" • Docket No. CL001201DIV 
Application Serial No. 10/644,021 
Inventors: Ming-Hui WEI et al. 
Title: ISOLATED HUMAN ENZYME PROTEIN... 

1 GCGCCTGGGG ACCGCAGAGG TGAGAGTCGC GCCCGGGAGT CCGCCGCCTG 

51 CGCCAGGATG GAGTTCGTGA AATGCCTTGG CCACCCCGAA GAGTTCTACA 

101 ACCTGGTGCG CTTCCGGATC GGGGGCAAGC GGAAGGTGAT GCCCAAGATG 

151 GACCAGGACT CGCTCAGCAG CAGCCTGAAA ACTTGCTACA AGTATCTCAA 

201 TCAGACCAGT CGCAGTTTCG CAGCTGTTAT CCAGGCGCTG GATGGGGAAA 

251 TGCGCAACGC AGTGTGCATA TTTTATCTGG TTGTCCGAGC TCTGGACACA 

301 CTGGAAGATG ACATGACCAT CAGTGTGGAA AAGAAGGTCC CGCTGTTACA 

351 CAACTTTCAC TCTTTCCTTT ACCAACCAGA CTGGCGGTTC ATGGAGAGCA 

401 AGGAGAAGGA TCGCCAGGTG CTGGAGGACT TCCCAACGTA CTGCCACTAT 

451 GTTGCTGGGC TGGTCGGAAT TGGCCTTTCC CGTCTTTTCT CAGCCTCAGA 

501 GTTTGAAGAC CCCTTAGTTG GTGAAGATAC AGAACGTGCC AACTCTATGG 

551 GCCTGTTTCT GCAGAAAACA AACATCATCC GTGACTATCT GGAAGACCAG 

601 CAAGGAGGAA GAGAGTTCTG GCCTCAAGAG GTTTGGAGCA GGTATGTTAA 

651 GAAGTTAGGG GATTTTGCTA AGCCGGAGAA TATTGACTTG GCCGTGCAGT 

701 GCCTGAATGA ACTTATAACC AATGCACTGC ACCACATCCC AGATGTCATC 

751 ACCTACCTTT CGAGACTCAG AAACCAGAGT GTGTTTAACT TCTGTGCTAT 

801 TCCACAGGTG ATGGCCATT.G CCACTTTGGC TGCCTGTTAT AATAACCAGC 

851 AGGTGTTCAA AGGGGCAGTG AAGATTCGGA AAGGGCAAGC AGTGACCCTC 

901 ATGATGGATG CCACCAATAT GCCAGCTGTC AAAGCCATCA TATATCAGTA 

951 TATGGAAGAG ATTTATCATA GAATCCCCGA CTCAGACCCA TCTTCTAGCA 

1001 AAACAAGGCA GATCATCTCC ACCATCCGGA CGCAGAATCT TCCCAACTGT 

1051 CAGCTGATTT CCCGAAGCCA CTACTCCCCC ATCTACCTGT CGTTTGTCAT 

1101 GCTTTTGGCT GCCCTGAGCT GGCAGTACCT GACCACTCTC TCCCAGGTAA 

1151 CAGAAGACTA TGTTCAGACT GGAGAACACT GATCCCAAAT TTGTCCATAG 

1201 CTGAAGTCCA CCATAAAGTG GATTTACTTT TTTTCTTTAA GGATGGATGT 

1251 TGTGTTCTCT TTATTTTTTT CCTACTACTT TAATCCCTAA AAGAACGCTG 

1301 TGTGGCTGGG ACCTTTAGGA AAGTGAAATG CAGGTGAGAA GAACCTAAAC 

1351 ATGAAAGGAA AGGGTGCCTC ATCCCAGCAA CCTGTCCTTG TGGGTGATGA 

14 01 TCACTGTGCT GCTTGCGGCT CATGGCAGAG CATTCAGTGC CACGGTTTAG 

14 51 GTGAAGTCGC TGCATATGTG ACTGTCATGA GATCCTACTT AGTATGATCC 

1501 TGGCTAGAAT GATAATTAAA AGTATTTAAT TTGAAAAAAA AAAAAAAAAA 

1551 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

1601 AAAAAA (SEQ ID NO:l) 




FEATURES : 

5'UTR: 1-57 

Start Codon: 58 

Stop Codon: 1180 

3'UTR: 1183 



Homologous proteins: 

Top 10 BLAST Hits 



CRA| 108000024649260 
CRA| 18000004925908 
CRA| 18000004929946 
CRA| 18000004993865 
CRA| 18000004932414 
CRA| 18000004937535 
CRA| 1000682330885 / 
CRA| 335001098694081 
CRA| 18000005103884 
CRA| 18000005103885 



/altid=gi 1 127341 
/altid=gi | 4758350 
/altid=gi I 2135096 
/altid=gi I 2136196 
/altid=gi | 6753838 
/altid=gi 19506591 
altid=gi I 6002565 

/altid=gi | 115144 
/altid=gi | 2463565 
/altid=gi | 7434086 



63 /def-ref |XP_005134.2 I fa, 

/def=ref |NP_J304 4 53 . 1 1 farn. 

/def=pir | i 138245 farnesyl-. 

/def=pir | | 152090 squalene . 

/def=ref |NP_034321.1 I farn. 

/def=ref INPJD62111.1 I farn. 
/def =gb | AAF00038 . 1 | (AF0903 . 
95 / def =pdb I 1EZF | A Chain A,. 

/def=dbj I BAA22557 . 1 | (AB00 . 

/def=pir| IT00489 farnesyl-. 
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BLAST dbEST hits: 



gi | 12926380 /dataset=dbest /taxon=960 . . . 
gi 1 12945082 /dataset=dbest /taxon=960 . . . 
gi | 12921315 /dataset=dbest /taxon=960 . . . 
gi 1 11642571 /dataset=dbest /taxon=96. . . 
gi 19141948 /dataset=dbest /taxon=9606 . . . 
gi 1 13040072 /dataset=dbest /taxon=960 . . . 
gi | 12944143 /dataset=dbest /taxon=960 . . . 



Score E 

1441 0.0 

1370 0.0 

1346 0.0 

1330 0.0 

1281 0.0 

1233 0.0 

1055 0.0 



EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 

From BLAST dbEST hits: 

gi 1 12926380 placenta 

gi 1 12945082 T cells from T cell leukemia 

gi 1 12921315 Fetal brain 

gi I 11642571pancreas 

gi | 9141948 Burkitt lymphoma 

gi 1 13040072 bladder 



From tissue screening panels: 
Whole liver 
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1 ME FVKCLGHP EEFYNLVRFR IGGKRKVMPK MDQDSLSSSL KTCYKYLNQT 

51 SRSFAAVIQA LDGEMRNAVC IFYLVLRALD TLEDDMTISV EKKVPLLHNF 

101 HSFLYQPDWR FMESKEKDRQ VLEDFPTYCH YVAGLVGIGL SRLFSASEFE 

151 DPLVGEDTER ANSMGLFLQK TNIIRDYLED QQGGREFWPQ EVWSRYVKKL 

201 GDFAKPENID LAVQCLNELI TNALHHIPDV ITYLSRLRNQ SVFNFCAIPQ 

251 VMAIATLAAC YNNQQVFKGA VKIRKGQAVT LMMDATNMPA VKAIIYQYME 

301 EIYHRIPDSD PSSSKTRQII STIRTQNLPN CQLISRSHYS PIYLSFVMLL 

351 AALSWQYLTT LSQVTEDYVQ TGEH (SEQ ID NO: 2) 



FEATURES : 

Functional domains and key regions : 

[1] PDOC00001 PS00001 ASN_GLYCOSYLATION 
N-glycosylation site 

Number of matches: 2 

1 48-51 NQTS (residues 48-51 of SEQ ID NO:2) 

2 239-242 NQSV (residues 239-242 of SEQ ID NO:2) 

[21 PDOC00005PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 

Number of matches: 5 

1 39-41 SLK 

2 50-52 TSR 

3 158-160 TER 

4 313-315 SSK 

5 322-324 TIR 

[3] PDOC00006 PS00006 CK2_PHOSPHO_SITE 
Casein kinase II phosphorylation site 

Number of matches: 3 

1 81-84 TLED (residues 81-84 of SEQ ID NO:2) 

2 145-148 SASE (residues 145-148 of SEQ ID NO:2) 

3 147-150 SEFE (residues 147-150 of SEQ ID NO:2) 

[4] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

Number of matches: 2 

1 137-142 GIGLSR (residues 137-142 of SEQ ID NO: 2) 

2 276-281 GQAVTL (residues 276-281 of SEQ ID NO: 2) 

[5] PDOC00009 PS00009 AMI DAT I ON 
Amidation site 

22-25 GGKR (residues 22-25 of SEQ ID NO: 2) 

[6] PDOC00802 PS01044 SQUALEN_PHYTOEN_SYN_l" 
Squalene and phytoene synthases signature 1 

128-143 Y CH YVAGLVG I GL S RL (residues 128-143 of SEQ ID NO: 2) 

[7] PDOC00802 PS01045 SQUALEN_PHYTOEN_SYN_2 
Squalene and phytoene synthases signature 2 

164-189 MGLFLQKTNIIRDYLEDQQGGREFWP (residues 164-189 of SEQ ID NO: 2) 
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Membrane spanning structure and domains: 
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BLAST Alignment to Top Hit: 

>CRA| 18000004925908 /altid=gi I 4758350 /def =ref I NP_004 453 . 1 1 
f arnesyl-diphosphate farnesyltransf erase 1; 
Farnesyl-diphosphate farnesyltransf erase 1 (squalene 
synthase); Squalene synthase [Homo sapiens] /org=Homo 
sapiens /taxon=9606 /dataset=nraa /length=417 
Length =417 

Score = 743 bits (1898), Expect = 0.0 

Identities - 374/417 (89%), Positives = 374/417 (89%), Gaps = 43/417' (10%) 



MEFVKCLGHPEEFYNLVRFRIGGKRKVMPKMDQDSLSSSLKTCYKYLNQTSRSFAAVIQA 



LDGEMRNAVCIFYLVLRALDTLEDDMTISVEKKVPLLHNFHSFLYQPDWRFMESKEKDRQ 



Query: 


1 


Sbjct : 


1 


Query: 


61 


Sbjct: 


61 


Query: 


121 


Sbjct: 


121 


Query: 


138 


Sbjct: 


181 


Query: 


198 


Sbjct: 


241 


Query: 


258 


Sbjct: 


301 


Query: 


318 


Sbjct: 


361 



VLEDFPT YCHYVAGLVG 



IGLSRLFSASEFEDPLVGEDTERANSMGLFLQKTNIIRDYLEDQQGGREFWPQEVWSRYV 



KKLGDFAKPENIDLAVQCLNELITNALHHIPDVITYLSRLRNQSVFNFCAIPQVMAIATL 



AACYNNQQVFKGAVKIRKGQAVTLMMDATNMPAVKAIIYQYMEEIYHRIPDSDPSSSKTR 



QI I STIRTQNLPNCQLI SRSHYS PI YLS FVMLLAALSWQYLTTLSQVTEDYVQTGEH 



>CRA| 108000024649260 /altid=gi | 12734163 /def =ref I XP_005134 . 2 | 
farnesyl-diphosphate farnesyltransf erase 1 [Homo 
sapiens] /org=Homo sapiens /taxon=9606 /dataset=nraa 
/length=431 
Length =4 31 

Score = 743 bits (1898), Expect = 0.0 

Identities = 374/417 (89%), Positives = 374/417 (89%), Gaps = 43/417 (10%) 

Query: 1 MEFVKCLGHPEEFYNLVRFRIGGKRKVMPKMDQDSLSSSLKTCYKYLNQTSRSFAAVIQA 60 

ME FVKCLGHPEEFYNLVRFRIGGKRKVMPKMDQDSLSSSLKTCYKYLNQTSRSFAAVIQA 
Sbjct: 15 ME FVKCLGHPEE FYNLVRFRIGGKRKVMPKMDQDS LS S S LKTC YKYLNQT SRS FAAVIQA 7 4 

Query: 61 LDGEMRNAVCIFYLVLRALDTLEDDMTISVEKKVPLLHN FHSFLYQPDWRFMESKEKDRQ 120 

LDGEMRNAVCIFYLVLRALDTLEDDMTISVEKKVPLLHN FHSFLYQPDWRFMESKEKDRQ 
Sbjct: 75 LDGEMRNAVCIFYLVLRALDTLEDDMTISVEKKVPLLHNFHSFLYQPDWRFMESKEKDRQ 134 

Query: 121 VLEDFPT YCHYVAGLVG 137 

VLEDFPT YCHYVAGLVG 
Sbjct: 135 VLE D FPT I S LE FRN LAEKYQT V IAD I CRRMG I GMAE FL DKHVT S EQE WDKYCH YVAGLVG 194 
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Query: 138 IGLSRLFSASEFEDPLVGEDTERANSMGLFLQKTNIIRDYLEDQQGGRE FWPQEVWSRYV 197 

IGLSRLFSASEFEDPLVGEDTERANSMGLFLQKTNIIRDYLEDQQGGREFWPQEVWSRYV 0 
Sbjct: 195 IGLSRLFSASEFEDPLVGEDTERANSMGLFLQKTNI I RDYLEDQQGGRE FWPQEVWSRYV 254 

Query: 198 KKLGDFAKPENIDLAVQCLNELITNALHHIPDVITYLSRLRNQSVFNFCAIPQVMAIATL 257 

KKLGDFAKPENIDLAVQCLNELITNALHHIPDVITYLSRLRNQSVFNFCAIPQVMAIATL 
Sbjct: 255 KKLGDFAKPENIDLAVQCLNELITNALHHIPDVITYLSRLRNQSVFNFCAIPQVMAIATL 314 

Query: 258 AACYNNQQVFKGAVKIRKGQAVTLMMDATNMPAVKAI I YQYMEE I YHRI PDSDPS SSKTR 317 

AACYNNQQVFKGAVKIRKGQAVTLMMDATNMPAVKAIIYQYMEEIYHRIPDSDPSSSKTR 
Sbjct: 315 AAC YNNQQV FKGAVK I RKGQAVT LMM DATNM P AVKAI I YQYMEE IYHRIPDSDPSSSKTR 374 

Query: 318 QIISTIRTQNLPNCQLISRSHYSPIYLSFVMLLAALSWQYLTTLSQVTEDYVQTGEH 374 (SEQ ID NO:2) 

QIISTIRTQNLPNCQLISRSHYSPIYLSFVMLLAALSWQYLTTLSQVTEDYVQTGEH 
Sbjct: 375 QIISTIRTQNLPNCQLISRSHYSPIYLSFVMLLAALSWQYLTTLSQVTEDYVQTGEH 431 (SEQ ID NO: 5) 

>CRA|18000004929946 /altid=gi | 2135096 /def=pir | | 138245 

farnesyl-diphosphate farnesyltransf erase (EC 2.5.1.21), 
• hepatic - human /org=human /taxon=960.6 /dataset=nraa 
/length-417 
Length =417 

Score = 741 bits (1893), Expect = 0.0 

Identities = 373/417 (89%), Positives = 373/417 (89%), Gaps = 43/417 (10%) 

ME FVKCLGHPEE FYNLVRFRIGGKRKVMPKMDQDS LS S SLKTC YKYLNQTSRS FAAVI QA 60 
ME FVKCLGHPEEFYNLVRFRIGGKRKVMPKMDQDSLSSSLKTCYKYLNQTSRSFAAVIQA 



LDGEMRNAVCIFYLVLRALDTLEDDMTISVEKKVPLLHNFHSFLYQPDWRFMESKEKDRQ 



VLEDFPT YCHYVAGLVG 



IGLSRLFSASEFEDPLVGEDTERANSMGLFLQKTNIIRDYLEDQQGGRE FWPQEVWSRYV 



KKLGDFAKPENIDLAVQCLNELITNALHHIPDVITYLSRLRNQSVFNFCAIPQVMAIATL 



AAC YNNQQVFKGAVK I RKGQAVT LMMDATNMPAVKAI I YQYMEE I YHRI PDSDPS SSKTR 



QIISTIRTQNLPNCQLISRSHYSPIYLSFVMLLAALSWQYL TLSQVTEDYVQTGEH 
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Hmmer search results (Pfam) : 

Model Description Score E-value N 

PF00494 Squalene and phytoene synthases 425.8 4e-124 2 

Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 

PF00494 1/2 47 126 " 1 88 [ . 124.6 l.le-33 

PF00494 2/2 127 291 .. 146 317 .] 301.1 1.3e-86 
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1 TATTTATTCC TAATTAAATG GGGAGGAAAG TCTTTGAAGA GGAACCTCTA 
51 CTTTACTTTT TATACCGTCA TGGCTGGAAA CTAAGTTTTT AAGATTTTTC 
101 TGGGGTTCCC TTGGCCGAGG TGGGGAGTGG GAGGGCTGTC CAGTGGTAGG 
151 GACTTAGGAT TTTTAGTTTA CAGTAGTAGG GGAAACACTC TGTAATCTAA 
201 TACATAAGTA AATGATGTAT TAGAATATGG TAAATATAGG CAAGTAGACC 
251 CCCACTGGGA TTAGCAGTGG TGGAAATGTG AGAGAGGGCA AACAGGTGGG 
301 TCTAGATGAG GTGTGAGCAG ACTCGAGGGG CACAGGAGTT AGTCAAGCCA 
351 GTATCTGGGG GATAGTGCAG GAATAGTGAA CAGCTAGACA AAAAGTCCTA 
401 GGGCCAGAGA AAGCAAAAGC ATAAGAGATG GAGGCCAGAG AGGTAATCTG 
4 51 GGTGGAAGGC TGCAGCCTCT CAGGATCCCT ATAGGTGCTT TGGCTTTTGT 
501 TGGAGAGACA CTGAACAGCT TTGGGCAGTG AACGTACCTG ACAGGTTTCC 
551 TGTTTGTTTT TGAGATGAAG TCTCGCTCTT GTCCCCCAGG CTGGAGTGCA 
601 ATAGCGCGAT CTCAGCTCAC TGCAACCTCT GCCTCCTGTG TTCAAGCGAT 
651 TCTCCTGCCT CAGCCTCCCA GGTAGCTGGG ATTATAGGCG CCTGCCACCA 
701 TGCCTGGCTA ATTTTTGTAT TTTTAGTAGA GACGCAGTTT CAGCATGTTG 
751 GCCAGGCTGG TCTTGAACTC CAGACCTCAG GTGATCCGCC CGCCTTGGCC 
801 TCCCAAAGTG CTGGGATTAC AGGCGTGAGC CACCGCGCTC GGCTAGACCT 
851 GACAGGTTTT AAAAGGATTA CTGGTTGCTG TGTTAAAACA GACTGCAGGA 
901 TGGCTTAGGT AGCCAGTAGG TTTTTTTTTT TTTTGGAGAC GTAGTCTTGC 
951 TCTGTTGGCC TGGCTGGAGT GCAGCGGTGT CATCTTGGCT CACTGCAAAC 
1001 TCCGCTTCCC GGGTTCAAGT GATTCTCCTG CCTCAGCCTC CGGAGTAGTT 
1051 GGGACTACAG GCGCCCACCA CCACACTCGG CTTTTTTGTA TTTTTAGTAG 
1101 AGACGGGTTT CACCATGTTG GCCAGGATGG TCTCGATCTC TTGACCTCGT 
1151 GATCCACCCG CCTTGGCCTC 'CCAAAGTGTT GCGATTACAG GCGTGAGCCA 
1201 CCACGCCTGG ACGGGTAGCC AGTAGTTTCT AGGGCTGGAG AGATCTAGGA 
1251 TGAGAGAAGT TTCCACATTC CTGTTACAGG CTCTCTAAGG CTTCAGCTCC 
1301 TTTTTCTAGG ACTAAGCTGG ATCTCAAGTA AACACTAGAG AGGGGGCAGC 
1351 TGAAGCTCCA GGAGTGTGTG GGGCTCCCTG GGGCTGGATG GCGGTGGCGG 
1401 GCAGGCGAGC TGGGCTGTGC TCGGGTGTGT TACAGTAAAG ACGCCCAGCT 
1451 TGGCGCTGGC CCGGCCTTTT CACGGTTTTA GGCTCTACAG AGAGCGGCTG 
1501 CAGAGCTCAC CCGGCTGGCA GGAGCCACCG AGGCCGGACA CGTGGGCGAC 
1551 TTATTGACCA AGTGGGGAGG AAGCAGCCCC GCACTGCTCT CCCGACTGCG 
1601 GACCACCGTT GGGCTCATGC GCATCATAAG CCCCACCGCC TCACCTCCAG 
1651 TCCCCACAGC GTTCGCGCTC CCAGCCGGGG TAAGCGGAAG AAAACAAAGG 
1701 CCCGGCTCCA TCAGGGCACC AATCCCGCTC GTCGGCCTCT TTCTCGGCCT 
1751 CCAATGAGCT TCTAGGGTGT TATCACGCCA GTCTCCTTCC GCGACTGATT 
1801 GGCCGGGGTC TTCCTAGTGT GAGCGGCCCT GGCCAATCAG GCGCCCGTCA 
1851 GCCCACCCCA CGAGGCCGCA GCTAGCCCCG CTGGCGGCCG AGGCCGGTTG 
1901 AAGTGGGCGG AGCGGCGGGC GGGGCGTCGC CGTACTAGGC CTGCCCCCTG 
1951 TCCGGCCAGC CCCTCGAAGC ACCTACTCCA CAGGTCCAGC CGGCCGGTGA 
2001 GCGCCTGGGG ACCGCAGAGG TGAGAGTCGC GCCCGGGAGT CCGCCGCCTG 
2051 CGCCAGGATG GAGTTCGTGA AATGCCTTGG CCACCCCGAA GAGTTCTACA 
2101 ACCTGGTGCG CTTCCGGATC GGGGGCAAGC GGAAGGTGAT GCCCAAGATG 
2151 GACCAGGTGG GCCGAGCCTC CCTGCTTGCC CGGGGCGGGG AAGGAGCTCG 
2201 CTGGGCCGGC CTCAGGGCCT GAGCGGCCGG GCCCGGATCT GGGGCAAGGG 
2251 GCGCGGCGAG CAGGGCCGAC GCCTGGGTGT TCCCGTCCCC CTTTCCTCGA 
2301 GCCTTCCCCC TGTAGGGCCC GGGTGGACGC GGCCGTCCTG GCTGACCTGT 
2351 CCCTGCCCCC GCAAGCCGCC CTGGGCATGA GCGACTJTTG CGTGGTTCCC 
24 01 GGTGGTTGCG CTCCCCGTTT CGTCCCCTCC GTGAGCATCG GCGCTTACCG 
2451 GTATTTTAAC CCGAGGGTTA CACATCTGAG GCAATGTGGG TGGGTTACGC 
2501 GGGAGAGGAC GAGTGAGTTT TTTGGTAAGC GGAATGAACT ATGCAGATAA 
2551 CATCACATGA AGGCCGTTTC TGGAATGAAG TCTGACTCCT CCAGTTTCAC 
2 601 CACCTCTTCC GGAGCTCTCC CCGCCTTGCT GCCTTCCATC GCTTCATCCT 
2 651 CGGTGCTTCC TGAGTTTTAA AATCGCCTAT CTACGCTTCC AAGTTCCAAT 
2701 GAGTTATCTA ACGTCTATGG ATTAGCTAGG TGGTTGGTGG AAGGTCAGAA 
27 51 CTTGGTTTTA CTTAGATTTT TATCTGCCTC ATGCCTGTAC TATTTGTTTA 
2801 ATGAATGCAT AGGAGGTGTT TTTATTCCAA CAAGAAAATT ATTCGTACGC 
2851 GATTATTGAA TGAATAGACA AATTCAGCCA AGTTCTTCTG GTCTGGACCA 
2901 GCCTGGCTGA TTTCTGTAAC TTTTTTGGGC CAACAGGACA GTAGCAAATG 
2 951 TGACTCAGGC CGAGGCTTGA TAGGTGCCTG AACATCGGAG TCTTTCTTTC 
3001 AGTGTCCATG TGCTTCAGTA AACACACTAG AAAATAAATT TCTGGTTTTT 
3051 GTCCCCAGTA GACTACACCC TCATTTGGTG TTATTTTTCA CGTGCTATCT. 
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3101 TTAATACAGG TACATCCTTC AGTCTATTTG TAGAACATTC AGTTTTCTTC 
3151 ATCTTTTCTT TGCCGGTGCT ACATTATTTG AATTATTTTG CTACAGAATA 
3201 ACTTCTATTA TTTGATATGG CAGATGTCAC TTTTTATATT TAGATATAGC 
3251 ATTCATTTAT TTAACAAATA TTTGACGACC AGTTGTATAT CAGATAGTGT 
3301 TCTAGGTGCT GGAGGTACAA CAGTGAACAA GCTAGGTGAA GACCTTGATT 
3351 TTATAAAACT TACTTTTTAG TGGAAGAGAG ACAATTTAAA AAAGCGAATG 
3401 TACAGTTTTT CACGTGGAGA AAAGCACTGC AGAGGAAGAT ACTAGCAGGG 
3451 CAAGGGATCT GAGTGCAGTC AGACCTCATT TGGGTCCAGA CTTCATTCCT 
3501 CTATGTCTCT TTCCTTTCTA CAGAAAGACT GTTAGAGAAA ATGGTAGCAT 
3551 TGGTTTCCTG TTGGGAGGGA AAGTGGGTGG TCATGGTAAG TGGGTAGAGA 
3601 AAGACTTCAC AGTATACTGT TTTTGTACAT TTTGAGTTTT TTTAAAAGCG 
3651 AGACTTGAGC TATTCTAGCT CTGATAATAT GGTGCAGTAT TTGTTATGTT 
37 01 AGTTGTAGTC TTTCTGGGCA GTTTTTACAT CCCCATGAGC CGTTAAAAAA 
3751 ATACCTGAAC CTTTAATTAG GGGAAATAAA TTGGAAAAAT ACATTTCCCT 
3801 TCACTTAACA TTATCTTAGT TTCTCTTTTT TTTTTTTTTT TTTTTTGAGA 
3851 TGGAGTCTTG CTCTGTTACC CAGGCTGGAG TGCAGTGGTG GCGGGACCTC 
3901 AGCTAGATGC AGCCTCCGCC TCCTGGGTTC AAGCAATTCT CCTGCCTCAG 
3951 CCTGCTGAGT AGCTGGGATT ACAGGCACCT GCCACTACGC CCGGCTGATT 
4001 TTTTGGTATT TTTAGTAGAG ACGGGGTTTC ACCATGTTGG CGAGGCTGGT 
4051 TTTGAACTCT TGACCTCAAG TGATCTGCTC GCCTTGGTCT CCCAAAGTGC 
4101 TAGGATTACA GGCGTGAGCC ACTGCACCCG GCCTTTTTTT TTTTTTTTTT 
4151 GAGGGGGGGG TCTCACTCCA TCGTCCAGGC TAGAATGCTG TGGCCTGAAC 
4201 ATGACTCACT CCAGTTTTGA CTTCCTTGGC TGAAGCCATC CTCCCACCTC 
4251 GGCTTCCTGA TCCCGAGTAG CTGGGACTCC AGGCACGTGT CACCAATGCA 
4301 TGGCTAATTT TTAAATTTTT TTGTAGACAC AATGTCTCGC TGCATTGCCC 
4 351 AGGCTGGTCT TGAACTCCTG AGCTCAAGCG ATTTTCCCAC CTCAGCCTTC 
4401 AAAGTGCTGG GATTACAGGT GTGAGCCACT GCACCCAACC AGTTTCTCTC 
4 4 51 TGCAAACTAG GGAAAAAATT TACGCTTAGC AGATATTGAG GGCTGATTAT 
4501 TTCTATCACA GAAGCATTTG GCTATAGAAT TTCAGGGTTT AGTAAACTTG 
4 551 ATTTACACTG AATTTTTAGG TGCATATCAG TAAATCTACG GGCATATGCC 
4 601 GCCTGCAAGT TGTGTGGCAT CACCCAAAAG CCGAGAGTAA TGGAAAGAGC 
4 651 AGGCTGTTAG TAATCAGGCA GATCTGGCTC CTGTCCAATC TAAATCCTGT 
4701 TATTTAGACT AATATCTTAA GTCTGTTATT AAGTCCGATT TCTGACGCTA 
4751 TTAAGTTAGG TGAACAACCT TGGTAACTTA ACCTCTGAAC CACAGTTACT 
4801 TCATCTGTAA AATAGGGATG TATGTATGGT AACGATTTTT TAACCACAAC 
4851 TTCCCAACTC TAAGATGGTC TGAAAAGAAT TTTTTGAGTG TTTGGCTCAG 
4 901 AATCACTTGG CAGCAAAACC TGACTTGAAG TTGAGGCTTC ATTCATCCCA 
4951 CTTAGTATAT TCAAATGTTT TGCTAAAGAA ATAATTATGA GGTGCTACTT 
5001 CACACTGACT AGGGTTGTAT ATGCATTTTA TTGCCTATTT TCTAAAACAC 
5051 TAAAAATGCT AAATTCTGCC CCAGGTCTTG CCACAGATGT TTCAGTGGAC 
5101 TATGGGCCTG TGAGACCTTA AAGGGTTGAT TGAGTAAGGA TCACAGGTGA 
5151 TGTCCGCATT GTGCTTGGCA TGGAGTTAAG TGCTTGATAA ATGGTGGTTA 
5201 TCAATCTGAT TATGTAAATT TATGTAAATT CAGTTCTCAA GTTTGTGGTT 
5251 TTTTTCCCCT CCTGGAGAAA TCTATTCTAT TTTAAAGTGA GGAAGGCTCC 
5301 GTGGAGGGCT GGTAGCTGGT AGCTGTTCAC TTGTGGAACT TTCAGCCTGA 
5351 GGCTGGAGCC CCTTCCTGGG AGTCTGGTCT TGTCGTCTTC CTGACCACCC 
5401 CCACACCCTT CCTCTAAATT CCCTCCATCC CTGTTTTTCT CCCGCTTGCG 
54 51 AGCTTTTGGG AGTGTGCTGA ATCTCAGACT GCAATAGATA AACCCAAGAG 
5501 GGACAGGCAC CAGTAGCCTG AGCTTGCTTT CTCCCCTGGC TCATGGGAAT 
5551 CAAGCAGTAG AAATTTTTAG TGAGTGTTGT TTTCCATAGT ATGCTTACTA 
5601 GTTGTGTCTT CCTGTTTTGT TCTTGGTGAT TTGAAGAAAC CTGTTTACAA 
5651 GGTAAGGGAC TGAAACAAAT AGGTGACAGG AAAAAGAGCA GCAGGGGTAC 
5701 GAGCTGGAGG AGTAAGTGGC TTGGCTTGCT CTCTTTCAGA ATGGAGGGCT 
5751 GTATGGAAAG GAGGGGTAGT GTTCTTGAAG AGTGTTGGGG TTTAAATCTA 
5801 GGGGGACCGT GTCTTGGCAT TGATTGAAAC TCCTGGCTTA ACATCACCCC 
5851 GAAACTGTTA GTTGGACTGA ACATGACATT TGGCAGTGCA GTTAAAAACA 
5901 CTTCCTGCTG TAGCCTGGTA ATGGTCAGGC TATGTGAAGA GCTGCTCTGG 
5951 AGCTCAGTCC AGAGCGGGTA TTCTGTTTCT TTCACTCTGA AATCCTGCCT 
6001 CTCGATATTT TGAGAAGGAA GGAGTTGGTG AATTGTTTTA AAATCCTCGA 
6051 TGAATGTCTT CATTTATTCA TGACACCACT TCTGAATATA TTTATGTGCC 
6101 AGACGCTGAA GTTTACTAAT ATTATGGTGC CCAGTAAATA CTTGTTTTTA 
6151 CTAATATTTT TTATGGCAAT AAAATGACTT TTTCAGGATT ATGTGATTTA 
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6201 AAAGATTGAC CCTTTTGGCA AAATACGTAT TCATGATAGG AAATATATAC 
6251 AACATAGTTC ACTTAAACCT CCCACCAGAG CCCAGGGTTC ACTGTTACCA 
6301 TTCTGAAGTG ACTGGAATTT CCTAGAAGTG GATATGCCAT ATTTTTTTAA 
6351 CCACTCCTAT TGGATATTTG TTTTTTATTT TTTTGAGATG GGGTCCCACT 
64 01 CTGCAGTGTA CAATATCATA GTTCACTGTA ACGTGTATCT CTTGGGCTCA 
-6451 AGCGATCCTC CCCACCTCAG CCTCCCTGAG TAGCTAGTCT TCAGTAGCTA 
6501 GACTATAGGT GGGCGCCACC ACAGCTGGCT TTTTAAAAAA TTTTTTATGA 
6551 ACACGAGGTC TCACTATGTT GCCCAGGCTG CCCTCAAACT CCTGGGCTCA 
6601 AGTGATTCTC CCACCTTGGC CTTCCGAAGT GCAGGGATTA TAGGCGTGCG 
6651' CCACTGCACC* CGGCCCTGTT GGATAAATGA TTCCAGTCTC TCCCAAAAAG 
6701 AACTGTTGTA AGACTGTGGG GTGAGGGGAG GGAAGGGACA AATAGGAACC 
6751 CGCCGTATTT TCCACTCCCT GTGGGCCTAA AACTGCTCTA AAAAATAGTC 
6801 CATGAAAAAA TACATAGTAC AAACAGCAAC TCTTTCTGAT ATGCTTGCAT 
6851 TTAAAATCAG GCTTTTTCTC CCTTTTGGAA AAACACAGTC CTTGTTTGCT 
6901 TTAGGGAAGA GTAAAGGTCA GTGCGCTGCA TTGCATTAAT TTCGAAGGGA 
6951 AAGATGAGAA GACATCTTGA AAGGAATGGC TGGCTTTCTA GAGAATAGTA 
7001 GAGGCTTAAT AGGTGTCATA GAAAAACCAG GGTTGGACAG TGGTAGTAAA 
7051 ACGGCAAAAC AGATTTTATT CAGAAAAACT ACTGCAGTAA GAGGAGAGAG 
7101 ACCTCGGTAC AGAACTGCTC CACTGCGAAT ACAAAGAAAA GTAGGAATTG 
7151 ATGGCGGGGG AGCCGGATGT CAGTGGATGG AAAATTATTA CGAGGAAACA 
7201 CAGGGGTGTG CATTCTTGCT GAAGGCAGGC CAGAGTTATC AG AC AT C AC C 
7251 TGAGGGATGG AGGGGGATGT GGAACCTAAT CGGCTGTCTA GGGTGATCAG 
7301 ATACTGAAGT TGGGGGATTC TGGTCAAATC AATTTAGCAG GATTCTTGGT 
7351 AAAACTGGGC GATGCAAAGA CAGATGCGTT GAGTACAAAG TCCAGGCTTT 
7401 ATTGGGAAGA GGATTTCAGC GGAGCCCGAG TAGAGTTTGG TCTAGGGAGA 
7451 CTCTGTCACT GGGAGGACGA GCGAGCCGCT CGGAAGTGCG .CTGGGTTCTC 
7501 TTAGCGGCCA GTGGGTTCTG GTGAGAAGGG CAACAGCGGG AGGAGGCGCC 
7551 GGTGCGGAGC GGGAGGCCGG GGGCGGGGCT GCGGGGCTGC GGGGCGGGCC 
7601 CGTTGTGGGT CGGCCCAGCG CGTATTCGAG TAGAGGGCGA GCCCGTCCCG 
7 651 CCTCTCGTCG GGCGCTTCCC AGATCTGCTT GAGTCTATGG AGGAAAAACT 
7701 CCGCGGGGTC CGCGATTCCC ATGGCCGCAG CCGCCTGCGG CACCAAGGCC 
7751 ATGGCCCTCT TCAAGCGCAC CTTGGTGCTG AGTCCCGCCG CGGCGCCCAG 
7801 GGGCCCGGGC GCAGGCACCG CCCCGCGGGG CTGCTGCTTG CCTCCTGCCG 
7851 CCTGGCCCTG CAAGGACTGG CCTCGGGGAG AGGGCGGCAG GCTGTGGAGC 
7 901 CGCCTGCCCC AGTCCCAGTC CCACTCCCAC TCCCACTCCC ACTCCCACTC 
7 951 CTGCTCCTCG ACGTCTCCCA CCGCCGTGTG TGTTGTCTGC CCGCAGGACT 
8001 CGCTCAGCAG CAGCCTGAAA ACTTGCTACA AGTATCTCAA TCAGACCAGT 
8051 CGCAGTTTCG CAGCTGTTAT CCAGGCGCTG GATGGGGAAA TGCGGTGAGT 
8101 GATGGAGGCA GCGCCTCTGG CTTGGAGGAA AGCTTGTCCG GGACTTTTGA 
8151 GTGTGTTGGA AGCTACCTTT TGATATAGCG CTCAGCGTTG CAGCCTCGTT 
8201 GCTGTGGCTT ATCCAGAACA TAGCCCGGCC CTACGTGTTT ACT T TAG AAA 
8251 GCCCTTCCAG GCTCTTTGCC ATCTAGTAGA GTCCCTGCGG GCCCAGCCTT 
8301 TCAGAGAAGG GGGGGGAGGG GGTGATGTTT ATTAACTTTT TTTAGTCTTG 
8351 GCAGCTGAAC CTGCCTGTGA GCAGGTCGTG TATTTCTCGG CTTCCCTTAT 
8401 CCAACTTTGC ATTTCTATTT C TAG CAT ATT GGGTTGATTC TTTTGAAGCT 
8451 GCCTCTGTGC AC ATT AC AC C CATGAACTTA GACCAGTTGC CTTTATGTAT 
8501 GATCGTATTT ATACTGAGAA GTTACTGTGT TTTTTGACTT TCTTTTCTAT 
8551 TTGCTACATA TTAGTTCGGT CTAAACGTTT GGTCTTCTGG TCTCCATAGT 
8601 TCTACATTGG TTAAATGCAA CTCACTTCTG GGAGTAGTGG TGACATTCAA 
8651 CTAGTAGGCT TTTTAATAAA CTACAGAAGT TCATTACTCT CATGTAAGGA 
8701 AGGAAAACTA ATGTAACTTT CGTTAAGTAT GAAAAGCGTT GGATATCCTT 
8751 ATAGTTCTTT AGAGTTAAGG GTGAGATGGG TTTAGAAAGT GGCCAGGCAC 
8801 AAGTTATTTT AAAATAAAAA ATCTTTGGCT GTTTGTTCCA ATATATTAAT 
8851 AGTTTTCCCT TTTTTACAGC AACGCAGTGT GCATATTTTA TCTGGTTCTC 
8901 CGAGCTCTGG ACACACTGGA AG AT G AC AT G ACCATCAGTG TGGAAAAGAA 
8951 GGTCCCGCTG TTACACAACT TTCACTCTTT CCTTTACCAA CCAGACTGGC 
9001 GGTTCATGGA GAGCAAGGAG AAGGATCGCC AGGTGCTGGA GGACTTCCCA 
9051 ACGGTGAGTG GGGTTACGCA TCTTGTCTAC GGACTGTTGT GTTCATAATT 
9101 GCTAACGTGG TTGTCCGGTA GCCTCCATAC ATGTGGAGAA AGGTTAAATA 
9151 AGCATTCTGA GGGCAGCATA ATGTGAGGGT TAAAAACTCC GGTAGCCAAG 
9201 ACTCTGAAGC CAGGCTGCCT GGGTTGGAAT CTCAAATCTC CCACTTACTA 
9251 AACTGTTGGT TACTTACAAA GACTCTCTGT GCCTCAGTTT CTTCATCTGT 
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9301 AAAATAGGGG TAATAATAAC ACCTACCTCA TGGTATTCTG AGGATTCAAA 
9351 GAATTAACGT AGGTAATGCT CTTAGAATGT TAGCTACTGC TGTTATTATC 
94 01 AGTATTGGAA GTCCAGTGTT TCTTCCTGTG GGAAGACGCA GTCAAATTTT 
94 51 AGTGTTGTGA AAGATTCTCA GGCTAGCTCA CAAAAGCCTG CCGACTGTAT 
9501 GATGCAGCCT ACCTGTAACA CTGCTGGCCT CTTGACTACC CGGAGCCTGG 
9551 TAGCATGGGA CTGCTGCTCA CGATGGGCAG CAGCCTGGCA TGGGGGCGGT 
9601 GTCTGTTGGC AGCTAGGGCG AGCCTCTGCC ACTTCACCTG TGATCCTGGG 
9651 CAAGTTCCTT ATCTGCTTTG TGTCTCCGTC TCCTCGTTTG TAAAGTTAGA 
97 01 GCTGAGAGGA TTAATTTCGC ACATATAAAG TACTTAGTGC CTGGTACAGG 
9751 GTAAGTATTC TGTAAGTATT AGCTATTTGG TCTATTTTGT TGGAGTAAAG 
9801 TGGGTTATAG TTAAAATCCT AAGATTTTTA AAGTCCCTCA AGTTCACGTG 
9851 GACATCTGCC TAGGTCCTAC TATCCTAGAA TTCGCATGTC T TAT C AC AC A 
9901 AATAACTGAT TCTTCCATAT CTTATAAATA AAGGTTTGAT TTAGCAAAGT 
9951 CACATGTTGT GTAATAGCTC GAAGAAGCCC TTTTTGTCCA CAGTTGCCAG 
10001 AGCTTTTGGA GAACAGTCCT TATGTTATTG AAACAAACCT AATCTGTAGC 
10051 TGAGTTGGGA GGGAGCTAAG TGGACAGAGA GTCCTCCACC CAAACAAAAG 
10101 AATCTTTGAT TCTTGGGCAT AATGGGAGCA ATATTTAAAA AAAAAAAAAA 
10151 AAAAAAAAAA GGAATGTTTG GGGAAGACTC TTGCGGTGCA AAGGCTGTTT 
10201 CAGATTGCTG AGATCAGACC TTAAGTACCA AAGCCCAAAT ATAGTACAAC 
10251 ATAATACAAA TGAGAAGAAA ATAGCTGAAG AATAATTCGA GTTTATACAG 
10301 TACAATTCAA GAGAAGAAAG AAAATTTATG ACGACTAGCT GGGTGAGAAT 
10351 TAGAACTGTA ACCCTGGGAA GGTCCTGGTG ATTTGACTCT CACAGGACAC 
10401 CTGATGACCA .GAGGATGGGT TTCCTTTGAT GGGAAATCTG TGGCGATTCA 
10451 TTGATGGGCC TCTGAATTCT GCTGAAGCAG AGGAAGTAGT AATACCCCAT 
10501 TTATAATGGA AGTGCATTCT CACTTAAAAA CAACTAATAT TATTCTAGCT 
10551 GGACCTAGCC TCTAGAAACA GCCAAATTAC ATTTGACTTG AGTGGATTCA 
10601 TAATAATTAA AAAATTTCTG GGGCATGGGA TAAATGTGTT AGGTATTGCT 
10651 AAGTCAAGGC AGCCCTATCC CCTCAGCAGA AGTGAGGGAA TATGAAAGTG 
10701 TGTGAATGCT AACATAATTT TGGGGAATAT CGCCGTCAGA TTTCCAGATG 
107 51 ATATTCCAAC ATGTTTGTGA AACTTCAGTG TCTTCCTGTG TTCATACAGT 
10801 GTTCCAGTGG AAAAATAATG CTTAGTTCTG GAAGGTTTCA GATGTGAACA 
10851 CTGAACTCAT CGTTTTCTTT TTTGGGTAGT AGAGTTAGAG ATTCCATCCT 
10901 CTTGAAAGCA CAGTTGCCCC GGGAAGAGTA AAAGGGAGCA GAAGGCGTAA 
10951 GCCAGGCACG GCTGTTTTCA CTGTTGTTCA CCTTTTGTAT CCTTACGAAT 
11001 ATGAAGATGT ACTAAGTTGT GTGTTTTGCG TGCATATATA ATTTTAAGCT 
11051 ACTTGAGTTG TAGGTCCCTC CAGTCTGTGA TTCAGTTTGA GATGGGACTG 
11101 TATGGGAATT AACAGTGCCT TGTCTTCTTA AGCAGTGATT TGTGTATGTG 
11151 CTGATATAGC TCAGTATGTC TTTGAAACCA GTTGTCTGGG GCTAGGCCTG 
11201 CAATCAGCTT TTGGCTAAGA GGTCCCAGGA TGGAACAAGT AGTGTGAAAG 
11251 AGGACTGATA CCTTGGCCTC ACACACAGTA CTGCTCTTAG ACTGGGGCAA 
11301 GTGAAACTCC TCACTTCAGA GTGCCCCATT CTAGGCCCCC TCACTCCCAA 
11351 AGGGGTGAGG GATCACTGGG GCCATGGGAA TGTGCTTGTT CAGCTCTCGT 
11401 GGGCTCTCCT TCTGTACCAC GTTCTGGACA TCTGGAGTTC CTTGCCCCAA 
11451 ATCCCTGAGC CCACGTCTGC GTCCGCACAG TCTATTTCCT AAGGTCAGTC 
11501 CATCTCCTCC AGGTGGGAAC GTGCCACCAT TGACTGTGCC CTTGGGCCTG 
11551 AGTGATGGCC AAGGGCTGTG TTGGGGAGTG TTGTGGATGG ATCCTGGCAC 
11601 CGAGGGCTGG GATATCCTCT CAAATGAATG TGAGGTGCCT CCCAGTGCTG 
11651 GAGAGAGCGG GATTCAGGAA GCAGTGGAAG GGAAGAGCCT GGGATATGGG 
11701 GATCAGCTGT CTGTGCCCTG CTGCATTCTG GAATAAAACT CTGAGGGACT 
11751 AAGAATTCTA AATTCAAACC TGAATCAACC AGGTTGTTAC AAAGATAAGT 
11801 TTGTCAGTGC AGGAGGATAC AATATATTTT ACTTAAGTTA CTAGCTCGAT 
11851 TGATCATTTT TAAATTTTTA GCTACATATA GTATGTGGGC CTCCATTTGT 
11901 CCTCTTATCC CAGGCCTTGC AGAATTTAGG AATAAGCCTC AATACAGTGT 
11951 TCTAACCCAG TGACTTCCGC CTCGATGTAC AGTAGATTGA ACCTGATCCT 
12001 TTATACTTTA GTGATCATTA GTTGATACCA GTTCAAGTCA GGCTTTCTAG 
12051 AAATCTCATT GTATGTTAGG GGTTCGATTA GAGTACAGTC ATGCATCACT 
12101 TAATGAATGG CCACAGGATA CATTCTGAGA AACGCATTGA TAGATGATTT 
12151 CATCATTCTG TGAACATCAT AGAGTGTACT T AC AC AT AC C AAGATGGCAT 
12201 AG C TACT AC A GACGTAGGCT CTGTGGTACA GGCCATTGCT CCAAGGCTGC 
12251 ACATCTCTAC AGGATGGTAC TGTACTGAAT ACTGTAGGCA ATTGGAGCAC 
12301 AGTGGTAAGT ATTTGTGTAT TTAAACATAG AAAAGGTATA GTAAAAACAG 
12351 GGTGTTACAG TCTTAAGGGC CCACCATTGT ATTTCCAGTC TCCGTTGACT 
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124 01 GAAACATCAT TATACAGTAC ATGAGCACGT ATCTTTCTCA CCTGGTACTA 
124 51 GTGGAAAGCT AGAAGGCTTA GAAGTCTACC TGTAAACATA GCTTAAGTAA 
12501 TAATACAGCC TTATTTTTAA ATGATAATAG CAATAATAGT GTTCACTTAT 
12551 TGAGCATTTT ACTATGAGTT ACT TACT AAA TATATTTCAT CGTTAATTTA 
12 601 CTCTTTGTGT TATTTGATCT ATAACATCGT TTAACAGGGA AATTACCTAG 
12 651 TACATAATGT ACTGTTATCT ACATTTTATC TAGATGAGGA AACTGAGGCA 
12701 CAGAGAAATT AAGTACTTTG CCTAGGATTA CCCGTGAAGT TAAGTGACAG 
12751 AATCAATGAA TCTGGAAGGT CTGGCTTCAG ATCTCTTGTG CTGAGTCACT 
12801 CGCATACTTT ACTACCTCTA AGGTTTCTAA TCAGAGGAAT TTGTATCTGT 
12851 ATTCCCTGCT ACTCTTACCC TCTATGTGGG ATTTGGCCTT TCTCCATTAT 
12 901 CCCTGTGAAC TCGCTCTGGG ACCTTCCTTC TTGTACTTGG AACCATCAGA 

12 951 AAGTGATCTG AGAACATAGA AATCTACTGT GTTGTGAAAC AGAATTACCT 
13001 GGAAGCGGAA AAAGCCCTCC TGGCTCAATT CACATGTCAC GGCTTATGGT 
13051 CGTATCCGGG GAACATATGA AACTGGGCAC TGAGTGCGGA GTCAGGAAAG 
13101 CCCTGTCCAT CCTCTGGGTT TCTGGGGAAA ACGTGGACCC CTTCATTGTC 
13151 ACTTTCTCCT GTATATTTTT GTTTTTACTT TTAGAACTGT ACAATTACGT 
13201 AATAAATAAT AAAAAGTCGT TGGAAGGATA GGTGAAGTTC AGAAGTGAAA 
13251 GTGTTTTGGA GGAGTCTAAG CTCCTTCCCA CCCTCATTGA CCTTTCCTCT 
13301 CTAATAAATA GAACTGGTCT AACCAAGGAT" CTGTGGAATG AGCAGAGTCC 
13351 AACGGAGATT CAGGGATTCT AATAACCTCT TGTAGAATCA CTGGTTTGTT 
13401 TCAGCCACAA GAAGGAATTA CCTTTTGACA TTGGCTTGAA CAGCTGTTGT 
13451 GCAAAGAAAA ACTTTTTGGA AAGTTCTGGA AGTACCAGAT TGATTTTATA 
13501 GGTTTTTTTT TTTTTTTTTG GAGGGACATG GGGGTATTGA CAGTTGATGT 
13551 TAATCAGAAA TCCTAAATTA TGTGTATTCC TGGTATGTTG CAATCAGCCG 
13601 GCCACCTGGT TTTCCTCTGG GCTCTTAATT TTAGGTGTAT TCCGAGGAAG 
13651 TTTTTCTAAC TTTTCTGTAA ACACAGACCA GGTATATTGC ATACTTTCAA 
13701 TGTTTAACCA AATCTCTTCA CTGTTTGCAG TATTATCTGT AGGCTCTCAT 
13751 GTTTTAAGAC TTCCCCATGG TGTTTTTGTA TTGTATTTTG CTAACCTATA 
13801 AACAATTCTT TGAACTTAAA ACAAGATATT TGGGCAGTAA CAATAAATTT 
13851 TAAAAACATC AATTCAACTT TTTTACATTA GGGCTTGGAC TATGGAAAAA 

13 901 GTATTGGGCA GCATGCCTCA TACTGAGTTG TTTAATGAAT TTAAAAGTAT 

13 951 AGCCNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

14 301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14 351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14 4 01 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14 4 51 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14 601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14 651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14 701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14 751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14 801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14 851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14 901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14 951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
154 01 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
154 51 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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18601 GACCTGCCTT GCTAGGTTGG GGAAGTTCTC CTGGATAATA TACTGAAGAG 
18651 TGTTTTGTAA CTTGGTTCCA TTCTGTCTAT CACTTTCAGG TACAACAATC 
18701 ATAGCATTGG TCTTTTCACA TAGTCGCATA TTTATTGAAG CCTTTGTTCA 
18751 TTTCTTTTCA TTCTTTTTTC TCTAATCTTG TCTTCTTGCT TTATTTCATT 
18801 AATTTGATCT TCGATCACTG ATATCCTTTC TTCTGCTTGA TCGAATCGGC 
18851 TATTGAAGCT TGTTTATGCT TTGTGAAATT CTTGTACTTT GGTTTTCAGC 
18901 TCCATCAGGT CATTTAAGCT CTTCTCTACA CTGGTTATTC TAGTTAGCCA 
18951 TTTGTCCAAC CTTTTCTCAA GGTTTTAAGT TTCCTTGCGA TGGGTCAGAA 
19001 CGTGCTGCTT TAGCTTGGAG AAGTTTGTTA TTACCAACCT TCTGAAGCCT 
19051 ACTTCTGTCA ACTCGTTAAA CTCATTGTCC ATCCAGTTTT GTTCCTTTGC 
19101 TGGTGAGGAG TTACGTTCCT TTGGAGGAGA AGAGGCGTTC TGTTTTTGGA 
19151 ATTTTCAGCC TTTCTGCTGT GGTTTCTCCC CATCTTTGTG GTTTTATCTA 
19201 CCTTTGGTCT TTGATTTTGG TGACGTACAG ATGGGTTTTG GTGTGGGTGT 
19251 CCTTTTTGTT GAT ATT GAT C CTATTCCTTT GTTTGTTAGT TTTCCTTCTA 
19301 ACAGAGGCCC GTCAGCTGCA GGTCTGTTGG AGTTGCTGGA GGTCCACTCT 
19351 AGACCCTGTT TACCTGGGTA TCACCAGTGG AGGCTGCAGA ACAGCAAATA 
19401 TCGCGGCCTG ATCCTTCCTC TGGAAGCTTC GTCCAAGAAG GACACCCACC 
19451 TATATGAGGT GTCTGTCGGC CCCTACTGGG AGGTGTCTCC TCCCAGTCAG 
19501 GCTACATGGG GCTCAGGGAC CCACTTGAGG AGGCAGTCTG TCCGTTACTG 
19551 GAGTTCAAAT GCCGAGCTGG GAGAACCACT GCTCTCTTCA GAGCTGTCAG 
19601 GCAGGGATGT TTAAATCTGC AGAAGCCGTC TGCTGCCTTT TGTTTAGATA 
19651 TGCCCTGCCC CCAGAGATGC AATCTAGAGA GGCAGTAGGC CTTGCGGTGG 
19701 GCTCCACCCA GTTCAAGCTT CCTTGCTGCT TTGTTTACAC TGTGAGCATA 
19751 GAAGTGCGTA CTGAAGCCTC AGCAATGGCG GGGAGGCGCT TCCCCTCACC 
19801 AAGCTCCAGC ATCCCAGCTT GATCTCAGAC TGCTTGGCTA GCAGCAAGCA 
19851 AGGTTCCATG GGCATGGGAC CCCCCGAGCC AGGCACTGGA GGCAATCACC 
19901 TGCTCTGCCA GTTGCGAAGA CTGGGAAAAG CACAGTATTT GGGCAGAGTA 
19951 TACTGTTCCT CCAGGTACAG TCACTCACGC CTTTCCTTGG CTAGGAAAGG 
20001 GAAATCCCCT GACCCCTTGC ACTTCCTGGA TGAGGTGACG TCCTGCCCTG 
20051 CTTTGGCTCA CCCTCCATGG GCTGCACCCA CTGTCCAACC AGTGCCAATG 
20101 AGATGAACCA GGTACCTCAG TTGGAAATGC AGAAATCACC CATCTTCTGC 
20151 ATCGATCTTG CTGGGAGCTG TAGACCAGAG CTGTTCCTAC TGGGGCATCT 
20201 TGGAAGCAAC TCTGGGTCTG AGTTTCTGTT TGTTGCCCTG ATGTATATCC 
20251 CCAGTGCCTA GAATGATACT TGTTACATAG GAAGTGCTTG ATCCATGTTT 
20301 GCACAAATGA ATCTTTCTCA TAATGAGGTT TCTCTAAACA AGCTGTTCTC 
20351 CCAAAAACTT ACACCCAGCT TTATGTTGAA GCATCTCATT ATACATTGGA 
204 01 AAGATGAAAT GTGTAGTGAG ACTTTGAATC TTCTTTTGAA TCTAGAAACA 
20451 TTAGCATTTT- TAGACCATTC TATTTTAATA TTTATGAAAT TTATGAAATA 
20501 ATAAGAAACA TGAGGCCGGG CTCAGTGGCT TATGCCTGTA ATCCCAGCAG 
20551 TTTGGGAGGC CAGGGCTAGT GGATCATGAG GTCAGGAATT TGAGACCAGC 
20601 TTGGCCAACA TGGTGAAACC CCACTTCTAC TAAAAATATA AAAATTAGCT 
20651 GGGCGTGGTG GTGCATGCCT GTAATGCCAG CTCCTGGAGA GGCTGAGGCA 
20701 GGAGAATCAT TTGAACCTGG GAGGCGGAGT TTGCAGTGAG CTGAGATCGT 
20751 GCCATTGCAC TCCAGCCTGG GCAACATTGC GAGACTCCAT CTCAAAAACA 
20801 AAAACAAAAA CAAAAAAAAT GTGTGACCTA AATTAGGCTT ATAGATGAAC 
20851 CATTGCAGTC ATGATTAATT CCGCCATTGT TTGCCTTGTG ATCTTT.GGTG 
20901 CCATGTCTGT ACATATTTCA TGATTTCTGT GTTTTTACGG TTTCCATTTC 
20951 AGATCTCCCT TGAGTTTAGA AATCTGGCTG AGAAATACCA AACAGTGATT 
21001 GCCGACATTT GCCGGAGAAT GGGCATTGGG ATGGCAGAGT TTTTGGATAA 
21051 GCATGTGACC TCTGAACAGG AGTGGGACAA GGTTAGTCTC ATAAAACAGT 
21101 GTCTGTGTGT GATGTATTAG ACAGAGCTGG CAGTCCTCAT AGTGAAGCTC 
21151 AGAACAAGAA AAGTTGTCCA GTATTTTCAG CCCCTCTGGT TTTACAATTC 
21201 ATCTGTTTAG GTTGAATGTC T CAT CAT AAA CAGTTTATTC CAGAGTTAAT 
21251 TCCAAACCAG CAGCTATGTA GGATATCAGC CAGGCTAGGA GTAGGGTACT 
21301 GGAGAGAAGT GCTTATCTAG ACAAAGGGAT GTAATTGACC ATGAAGATTA 
21351 AAACTACACA TCAAAACATA AGGTAGGGTT AGGAGTCTTG CCTATTTTTC 
214 01 ATAGGAATGG TGTTTGTGAG ACTTACTCAT CACTTCTGTG GAAGTAAAGA 
•21451 CATTTTATTT ATTTATTTTA AAGCCAGTCA GATTTAGCAG GC AG AG AC AT 
21501 TTCAGACATC TAAAGTGTTG ATGTATTTCA TACCTTTAAC TGTGCTTAAA 
21551 TTAGGATCTC CGAAAAGATG CTGCTACATG GTCACTACGT TAGTGTAGGT 
21601 CCAAGGTCTT GGGCCTCTTA ATTTTTCAAA CCTCAAAACT TGACAGCAGT 
21651 TATCTTTGGA ACTGCTGATT TGTGCTTCCT AAGTTAACAG CATACAATGA 
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21701 CTGCTAGAAA TCAATTTCTG CATTTAAGGT GAAGTTAGCC GGGTACTATG 
21751' GTTTACCTGT AATCTCAGCA CTTTGGGAGG CTGAGGTGGG AGGATCATTT 

21801 GAGCCCAGGA GTTAGACACA AGCCTAAGCA ACATAGCGAG ACCCCGTCTT 

21851 TCAAAAAATT AAAAAATGAG GAGGGAATTG GTGGCATGTG CCTGTGGTCC 

21901 CCAGCTACTC TGGAGGCTGA GGTGTGGGAG GATTGCTTGA GCCCAAGAGT 

21951 TGAAGGTTGC AGTGAGCCAT GATTGTGCCA CTGCACTCCA ACGTGGGTGA 

22001 CAGAGCAAGA CACCTACTGA AAGAAAATAA AGTTGAAGTT AAAACTTCTG 

22051 GCCAAGAACC AGCACTGGTT ATGATAGTAA CTCATTTTCT GTTGTGCAGA 

22101 TTTATTCAGG AAACTTAATT TTAGGTTGTT GAATAGAAGT TTTGATCAGA 

22151 TAAAATTGAA TTAAAAAAAA TTTTTTTTGA GACAGGGTCT TGCTGTTATC 

22201 CAGGCTGGTG TGTAGTGGTG TGATCACGGC TCCCCGCAGC CTGAACCTCC 

22251 TGGGCTCAGG TGATCCTCCC . ACCTCAGCCT ACCGAGTAGC TGTAACTACA 

22301 GTGCATGACA CCATACCAGG CTCATTTTTG TACATTTTTT GTAGAGAGAG 

22351 GGTTTTGCCA TGTTGCCCAG GCTAGTCTCA AACTCCTGGC ATCAAACAGT' 

22401 CCTCCCACTC TGGCCTCTCA AATGTTGGGA TTACAGGCAT GACCAGCCAA 

22451 TTATTTCAAG GAGTTATTTT TTTTCTTCTA CTTTGGGGGA AGATGAATTA 

22501 TATAAGTCTC CATTTTAGGA GTATTTCTAC CAAAAGAACT ATTATCTTCA 

22551 AATATATTTT TGGATAGTAC TATAGATATA CTAATTTTTT TTTAAATTTC 

22 601 TAGTAATTCT TTTGAAGATT TTGTATAGCT GTCCAAAGCC AATTTCTGTC 

22 651 TACCTAATTT CAGCAAGATT TCACTCTTTT CATGTTACTT TTGTCCCAGA 

22701 ACAAATTTCA AGTGCTTTCT CTTCACCTGT GCATTCTTCC CCCTGATTAG 

22751 TCTCTGGCTT TGTATTACTT TCAGTCAGAG ACGACTTTTT TTTTTTGAGA 

22801 CAGGGTCTCA CTCTGTCACC CAGACTGGAA TGCAGTGGCA CAGACAAGGC 

22851 AGCCTTGACC TTCTGGGCTC AAGCAATCTT CCTTGCCCTC AGCCTCCTGA 

22 901 GTAACTGGGA CCACAGGCAC GTTGCCACCA TGCCTGGCTA ATTTATTTTA 

22 951 ATTTTTATTA TTTTTGAGAC AGGGTATTGC TCTGTCACCC AGGCTGGAGT 

23001 GTAGTGGCAT GATCAAGGCT CACTGCAGCC TTCACCTCCT GTGCTCAAGC 

23051 AGTCCTCTCA CCTCAGCCTC CCCATTAGCT GGGACTATAG GTCCACACCA 

23101 CTACACCAGG CTAATTTTTG TAATTTTTTG GTAGAGACAG GGTTTCATCG 

23151 TGTTGCCTAG GCTGGTCTTG AGCTCCTGGG CTCAAGCGAT TCACCTGCCT 

23201 TAGCCTCCCA GGTGTGAGCC ACTACACTCA GCCTTTTAAA ATTTTTTACA 

23251 GAGATGAGGT CTTGCTTTGT TGGCCAGGCT GGTCTAAAAC TCTTGGGCTC 

23301 AAGCAGTCCC CTCTCCACAG CCTCCCAAAA TTCCGGGATT ACAGGCGTGA 

23351 ACTTCGGTCA TTTCCTAACT TTTACCCTTC CTAATGACAC TCCAGAGCTT 

23401 ACCTTCTTTA CTTTTGCTTC TTAAGTTAAC TAATAGACAA TTATTGTATG 

23451 TGGATATTGC ATTAAGTTGT CTTAGGATAC CCTTTTCAGA GGAGGACAGC 

23501 TTTTGACAAA TTGCTGTCGC GGAAAAAAAA AGTATTTGGC AATTAAGAGT 

23551 TGCATTTACT GAAATCTCTG TTGAGAGAGG GGAAGTTACG TTGTCTCTAA 

23601 AAGAAAAACT AAAAAGAAAA GGGGAAGTTT TAGCAAAGTT GTTAAAGCCT 

23651 GACACTTAAG TCATACTACC TAGTTTTGAA CTCTTAGCCC CTGCCACAGA 

23701 CACGGCAGCC CCTTGAACCT TCCTGGGTTC AAGCGAGCCT CCTACTTCAG 

23751 CCCCCTGAGT AACTGGGACC ACTGGCCTGT GTCACTGTGC CTGGCTAATT 

23801 TTTTTTTTTT CCTCACATGG GCAATGTTGG GCAAGTTAAA TCGACTTCTT 

23851 TGTGCCTCAG TTTCCTCATC TGAAATGGAG ATCATACTGC TATGTACCTG 

23901 ATACAATGTT TGTGAGGATT GAATGTGCAG AGTTCTTTTT TTCTGTTGTT 

23951 GTTGTTTTGA GACGGAGTCT CACTCTGNNN NNNNNNNNNN NNNNNNNNNN 

24001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

24 051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNA TCTCGTGATC 

24101 CGCCCGTCTC AGCTTCCCAA AGTGCTGGGA TTACAGGCAT GAGCCATCGT 

24151 GCCCGGCTGA ATGTGCAGAG TTCTTAAAAC CGTGTCAAGA ACATAAAATA 

24201 GTTATTTGTT CTTTCATATA ATGATGATTT TGAGGGCCTG CGGATCTTGA 

24251 CATGTTATCA GATTGGTCAA AAAAAGATTA AACCATAGTT GGTATTGTCC 

24301 TAGTTCCTGT TACCAGAATA TTCCATCTTT CATCGTTGCC TTCTCTCATA 

24351 GTTTTATGTA TCAAAAAGTT TATTGTAAAG CTAGGCCGGG CACGGTGTCT 

24 401 TGGGCTGGTA ATCCCAGCAC TTTGGGAGGC CAAGGCTGGC AGATCAGTTG 

24 451 AGGTCAGGAG TTCGAGACCA GCGTGGCCAA CATGGTGAAA CCCCGTCTCT 

24501 ACTAAAAATA AAAAATTAGC TGGATGTGGT GGTGGGTGCT TTAATTCCAG 

24551 CTACTCAGGA AGCTGAGGCA GGAGAATCAC TTGAACCCAA GAGGCAGAGG 

24 601 TTGCAGTGAG TTGAGATTGT GCCACTGCAC TCCAGCCCAG GGGACAAAGT 

24 651 GAGACTTGAT CTCAAAAAAA AAAAAAAAAA AAAGTTATTG TAAAGCTAGA 

24 701 CACGGTGGTA TTTGCCTACA ATCCCAGCTG TTCGGGAAGC TGAGGCAGAA 

24 751 AGATTGCTTG GGTCCAGTAG TTTGAGTCTA ACGTGGGCAA ATATATGAGA 
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24801 CTCCATCTCA AAAAAAAAAA TAAAAAATAA AAATAAAAAA ATGTTTACTA 
24851 GTTTTTTTCA GTAGCCTTTT AT TAT AG TAG CAGTACATGT GTATTGTAGA 
24 901 AATTTGGAAA ATACAAGTGA AAAATAAAAA CATCAAATTC CCGTCAGCCA 
24 951 GAGACTGCTG TGAAATGTTT TGAGCACATC CTTCTTGAAT GTTTTTTAAA 
25001 TCCTGGTATG TATATTTGTA TTTTAAAATC AAAATGCATT CTTACCCATT 
25051 CTCTTTTGAA CCTGCTTTTT TGTAGCTAAT GATCTCTAGT GTGTCCATTT 
25101 CAGTAAAAAT TCCATTATTA AAGTGCTTTA AAAATCGTCT CTTACAGTAC 
25151 TGCCACTATG TTGCTGGGCT GGTCGGAATT GGCCTTTCCC GTCTTTTCTC 
25201 AGCCTCAGAG TTTGAAGACC CCTTAGTTGG TGAAGATACA GAACGTGCCA 
25251 ACTCTATGGG CCTGTTTCTG CAGAAAACAA ACATCATCCG TGACTATCTG 
25301 GAAGACCAGC AAGGAGGAAG AGAGTTCTGG CCTCAAGAGG TAACAGATTC 
25351 AGGGTATTTT GGGGGAAAAT AACTTTAGAC ATTCTCTGAA AAATCCTTTA 
254 01 ACTCTTGTGG TTGCGGGTGA CAGAAAAACA AGCCAGGCCT CCCCCAGGCA 
254 51 GCATAAGGGG ATGTGGAAAA TAGGATAGAT TGACATGAGT TTGCTTCAGG 
25501 TAGACTGGCT GACTCCCAGG ATTCACACCA CGTAATCAGT ATATTCAAGC 
25551 CTTGCTGTCC TTGATTTCTT TCAGACGGTC TTTCTCCAAG TGGTGGATAT 
25601 GGTAACAACC CACGTGCACT AGCTTAACAA AAAGTTCTTA GGAATGGCTT 
25651 TGTTCGGCCT GGCGCAGTGG CTCATGCCTG TAATCCCAAC AGTTTGAGAG 
25701 GCCAAGGTGG GCGGATCACC TGAGGCCAGG AGTTCGAGAC CAGCCTGGCC 
25751 AACATAGTGA AACCCCGTGT TTACTAAAAA ATACAAAAAT TAGCCGGGCG 
25801 TGGTGGCAAG GGCTTGTAAT CCCAGCTACC TGGGAGGCTG AGGCAGGAGA 
25851 ATCGCTTGAA CCCAGGAAGC AGAGATTGCG GTGAGCTCAG ATTGTGCCAC 
25901 TGCACTCCAG CCTGGGCGAC AGAGTGAGAC TCCCTCTCAA AAGAAGAGGA 
25951 AGGGCTTGGT TCTTCTGCTC AGCCCTGAAT CAGTTACTGT TGCTACACAG 
2 6001 CTGAGTTCTC TGGCCTCACC TGGATTACGT CTACACAGTA CACACAGAAT 
.26051 GGATTTCCCC CAAAGAAAGA ATTCTGCGGC AGGAAGGGGA .AAGGGATGGC 
2 6101 AGGTAGACAA AAACTCCAGG TGTCTGTAAT AAGGGACAGG GTCGATCTTT 
2 6151 AATTAAAACA TGGACAGGGA ACAGAAAGCT TTTGATACTG ATTTTGTTCA 
26201 GAAGGAAAGT AGAAAATTTT ATGACTGTTC CCTGAATTTA TTCCAGCATT 
2 6251 TACCTTTTGC TTTCCATAAA AGTGTTTCCT GCAGCCAAGT ACTTTAAAGT 
26301 TTTAAAAAGA CGGGTGAGGC TAAGTGTGGT GTCTCATACT TATAATCCCA 
26351 GTGCTGAGGC CAGGAGTTCA AGACCAGCCT GAGCAACACA GCAAGATACC 
26401 ATCTCTATAA AAAATTGTTA GAAAATGATT CTGCTGAAAG AGCAAAAATA 
26451 AAAATTAAAG AAAGTAGAAA AAATAAAACT AAATTTAAAA GATTAACTGG 
26501 GCATGTTGGC ATGCACCTGT ATTCCTAGGT ATTCGGGAGG CTAAGGCACA 
26551 AGGATCCCTT GAGCGCAGGA GCTCAAGGTT GGATTGAGTT GTAATCACAC 
2 6601 CACTGCACTC CAGCCTCGGT GGCACAATGA AACTGTCTCA AGAAAAAAAA 
2 6651 AAAGTGACAG AGGGAAACAA TATTTGCAAT TCATAGAGCA GATACAGGGT 
2 6701 TCATATTCCT AATATTAAAA AAAACTTCTA AAAGTTAAGA AAAAGGCCAA 
2 6751 CTGCCCCACA GAAAAATGGG CAAGGAGATA AGAACAAGAT TGTTCACAGG 
2 6801 AAGAGACACA CAGATGATTA TTAAAAATCT GAAAAGATGC TGAGTCTTAC 
2 6851 TCCTAAGAAA AATTCACATT TAAACTACTC TGGGGGCTGG GCAAGGTGGC 
2 6901 TCACGCCTGT AATCTCAACA CTGGGAGACC AAGGCAGGAA GATCACTGAA 
2 6951 GCCAGGGTAT CGAGACCAGC CTGGACAACG TAGTGAGACC TTATCTCTTA 
27 001 AAACAAAACA AAACAAAACA AAACAAAAAA AACAGTAAAA ATTGGCCGGG 
27051 CACAGTGACT CCTGCCTATA ATCCCAGCAC TTTGGGAAGC CCAGGTGAGT 
27101 GGATCACTTG AGGTCAGGTG TTTGAGAACA GCCTGGCCAA CATGGCAAAA 
27151 TTCCGTCTCT ACTAAAATTA CAAAAATTAG CCAAGTGTGG TGGCATACGC 
27201 TGGTAGGGCC AGCTACTTGG GAGGCTGATG TGAGACTCCA TTTAAAAAAA 
27251 AAAAATCAAA AATTAGCTGG GTATAGTGGC ACACCCCTAT AGTTCTCGCT 
27301 CCTTGGGAGG TTGAGGCAGG AGGATTGCCT GAGCCCAGGA GTTCAAGGCT 
27351 GCAGTGAACC AT GAT C AC AC CACTGCATTC TAGCAGCCTG G GAG AC AG AG 
27 401 CAAAACCCTT GTCTCAAAAC AAACAAACAA CAACAAAAAC AAAAAACACT 
27451 TCCCTCAGCT CAGACATGGC CTTTTAAGTT TCCTAGGTGA CTCGTGTGCA 
27501 GCCAGGGTTG AGAAACCACT CTTGTCTTAC CCCTCTTTTG CAGACACAGG 
27551 GCTCAGAGAA. GGGAAGGGGA TTGTCTGGGG ATGTATAGTG AGGCAGTGGC 
27 601 TGCCTTGGAA GTGGAGTCTC AGTCTCCCGG CTCCTAGGCC AGCCCCTGAC 
27 651 CACTGTTCCA TTGTCTCCCA GACAGAACAT CAGCCACGGG CATGTGATGC 
27 701 ATGAGCGTGA GCCACACCAT CTTGCACACA CAGGAGCAGA GCCCTGCTCT 
27751 TCTCATTCAC TTACTTTATC TGTAAAATAG CATCATTTCT ACCACACGGT 
27801 GGTGGTGTGA ATAAAATGAG ATGAACTTCT AGCATAGAGT GCTTAGTAAA 
27851 GGTTCTGGAC ATTTCGTAGT AGTTGAATCA TGCCAAATGT GGTCCTAGGT 
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27 901 GATTGGCTTC TTTTGCTAGC ATGTTTTCAG GGCTCCTCCA TGCTGGGGCA 
27951 TTGCATCACT. GCTTTATTCC TTTTTATCGC CTAGTATTAT TCCACTGTGT 
28001 GGATAGACCA CATTTATCCA TTCATCAGTT GGAGGATATT TGGGTTCTTC 
28051 CCATTTTTTT TGGCTATGGT GAATAGTACT GTGTACATTT GCATATAAGG 
28101 TTTTGTGTAG ATGTGTGTTT TCCTTTTTCT TGGGTCTATG CTGAGAAGTG 
28151 GAATTGCTGG TTCATACAGC AGCTCGAACC TTGTGAGGAG CTGCCAGACG 
28201 CTTTTCCAAG GTCGCTCCAC CATTTTACAT TCCCGTCAGC AGTGTGAGAG 
28251 TCCCAGTTTC ACCAGCACTT GTTGTTATCT CTTTTTAACT GTATGTATAT 
28301 ATACTTAACA TTTTATTTAT AATAAATGTA CATAATAGAG AATTTGCCAT 
28351 TTTAACTATT TTTAAGTCTA TTATTCAGTG GCATTAAGTA CATTAATGAT 
28401 GTTATATAAC CATCAACACT ATGTTTCCAG AACTTTCGCT AGCTTCAGAG 
284 51 AATCCTCTAA ATAATATCAT TAAAAATCAT CAAGCCGAAT CCCACTGTTA 
28501 GAATTAAAGG TTTTATTTCA CTTTCAAGTT ATCAGGATCC AGGGAGGTGT 
28551 AATACACTTA GAGGATAGAC TCAGCTCATT TCCCAGCTAT GCCTTTCAGC 
28601 AGCATTCTTA CCAGAGTAGG AATATAATGT TAGTCATTAT TTAGAGGCCT 
28651 GGCCATCTTG AGAAGGTTTA CTGTTTAGTC TGCAGTACAA TTATAACTGT 
28701 TTTTGTATAT TGGGTTATTT TTTTCAGAAG TAGGCCAGTA GCTCTAACAG 
28751 GAGCCTCTTT AGCCTGAATT CGTCCAAGTA GTGCAGTGTT GCACTAGTTG 
28801 TCCCTCGGGA CATGCTCCCC AATACGTAAC TCACTTCCAG GTTGCAACTG 
28851 G AC ACT TACT GGTAGTCAGA AATAGCTATT GCATGGAGCT TAAAATGAAC 

28 901 TTGATCTTCG TGAAAGATGA GTCTGCAGCT AAGAGACTTT ACTGTATATC 
28951 ATAGTGTTTT TTTTTGTTTT GTTTTGTTTT TGTTTTTGTG ACGGAGTCTC 
2 9001 ACTCTTTCAC CCAGGCTGGA GTGCAATGGC GAGATCTTGA CTCACTGCAA 
2 9051 CCTCCGCCCC CTAGGTTCAA GCAATTCTTC TGTCTCACCC TCCTGAGTAG 
2 9101 CTGGGATTAC AGGCGCCTGC CACCGTACCC GGCTAGTTTT TGTATTTTTA 
2 9151 GTAGACACAG GGTTTCACCA CCTTGGCCAG GCTGGTCTTG AACTCCTGAC 
2 9201 CTCGTGATCC ACCCTCCTCG GCCTCCCAAA GTGCTGGGAT TACAGGCGTG 
2 9251 AGCCACGGCG CCCAGCCTGT ATCATAGTTC TTATGCACAA AGACCCTTTA 
2 9301 ATATTGTTTG TAAATTCTCC CCTATGCACA CGCTGACCTG TTCCTTAATC 
29351 TTCTTATCTG TCTAGGTTTG GAGCAGGTAT GTTAAGAAGT TAGGGGATTT 
2 9401 TGCTAAGCCG GAGAATATTG ACTTGGCCGT GCAGTGCCTG AATGAACTTA 
2 94 51 TAACCAATGC ACTGCACCAC ATCCCAGATG TCATCACCTA CCTTTCGAGA 
29501 CTCAGAAACC AGAGTGTGTT TAACTTCTGT GCTATTCCAC AGGTAGGGAA 
2 9551 CGGGGCTCCT CTGGGTGGAT ACGGGGCTAA AGGGAGTGGG GTAGGAGTAA 
2 9601 GGGTGGATTT TGCTGTGCTA TATTCAAGGA TATGATTCCT TAAAAAGACG 
2 9651 ATGACTCCAG TTTATTACGC TGGGAGTTTC ATAGCACCCG CCTTTGCTTC 
2 9701 CAGCCACCAA ACTCAGCTCA GCCTTGAGGT TAAGCCTGCT CCTTTTCAGA 
2 9751 ACCTTCTTTC CGGATTTACT ATTTTCTACA GCTATCCTAA ACTAGTTAGG 
2 9801 TTCTTTTCCT CACAGTTAAG TCAAGGTCTT TGGCTTAGAT TTATGGGGAG 
2 9851 TGCTGGGTAA AACCTGGGTG AAGCTGTTAT CATTAAAAAG TCTTCATTAA 
2 9901 GCACCTAATT ACTGCTGTCC TTTTCCTAGA CCCGGCATAA AAAGAACCTG 
2 9951 GTCCGGTAGA CCTAGCCTCT CAGTATGCTA GGAACTTACA CTTTTTAGTT 
30001 GCCTTTACCA AGTATTGCAG ATACTACTGC AAATAAGTGA AGAAAGTAAC 
30051 AGCATTTAAC TGATTTGGGA ACTTGGTTTG ATCTTGTTCT AATGACCCAC 
30101 TTCGAATGGT GGTTGAAAGT AAAATCTGTA TCGCCGTCTT ATGTTTCCAT 
30151 TTACCTAGAA ATACTTTACC TTTGAGCACA GGAAATTAAT CCCCTTCTGG 
30201 TTGTTCTCCC CCTGGCATTG GTTTTAAATA TATAATGATT ATGTTTGTTG 
30251 TAGGAAAAAT AGAAAAACAA CTACAATAGA AAATTCTTCC CATATATTAT 
30301 TTTGAAATAC ATATTTCCGA TCCGATAATC CATTGCTCTA GCATGGAAAA 
30351 TGTTGGATTT ACTTGTGTTT GCTTTTTCCA AATAAAATGG AACTTTTGTG 
30401 GCTACATTAT AGAATTGTTT TAGACTGCTT AATTCTGTGT GTTGTTGAGA 
30451 AAGGGAGGAG TGGGGAAGGT AAAAATCTTG ACATACTTTC TTCGTGGGTA 
30501 TTTTTTCTTG AGCGATTCCA TCTTAGTTGA TTAGCAGTTA GCAATTGCCC 
30551 ATTCAACAGA AGGTTTTCTT ACCTTTTTGT GATAATGATA GCTAACGACA 
30601 TCATTTCTTC TTTTTTCCCT CTCTTCTTGT TGTCTCTAGG TGATGGCCAT 
30651 TGCCACTTTG GCTGCCTGTT ATAATAACCA GCAGGTGTTC AAAGGGGCAG 
30701 TGAAGATTCG GAAAGGGCAA GCAGTGACCC TGATGATGGA TGCCACCAAT 
30751 ATGCCAGCTG TCAAAGCCAT CATATATCAG TATATGGAAG AGGTGGGTTT 
30801 TTATTTAACT ACTTGGATAA TTTGTAGCTA CTTTTATGAT TTAGTAATGT 
30851 CACTGTTTAA CCAGGTTTGG ATATTAGATG ATCCTAACAA TTCACTATCC 
30901 TGTGGCCTAA AGAGACAGGA ATTGATATCC TTTATAAGGA AAAAAGTCTA 
30951 TTCACAGGAG CCGAGCAGAT TGCTCACTGC TGTGTAGTAC CCTGGTGAGA 
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31001 GGAGATAAAT GGAGCAAGGC TGTAGGTTGG AGCCCCTCAG TAGAATCATA 
31051 GATTTTGAGC TGCAAGATGA TGCAGGAGGC CAACCAAGCT TCTTGTTGCT 
31101 GGTGAGGAAT GTGAGGTTGA AGCTTGTCTG TGCTGATGCA GTGCGTGATT 
31151 GAGTGGATCT CTGGCTCCCG TCCATGTGTC CTGACACCCA GTCTGGTACT 
31201 TTCATTATGC CACAGGCCTC AATTGAAAAA TCACAGTAGG GAATTTAGGC 
31251 CAAGGAAAGC CATCAAGTTG CAATTATTTC CTAAATTTTC TTTGGAAAAT 
31301 TTCATTTCAA ATACCAAAAC CATCCTATAA AAAGAAAACT TACCTTCTTA 
31351 GGTCAAATCT CTAATATTTG ACTAGGTTCA AAAAGTTTAT TTCTGGCCAG 
314 01 GCACAGTAGC TTACTCCTGA AATCCCAGCA CTTTGGGAGA CCAAGGTGGG 
314 51 AGGATCACTT GAGGCCAGGA ATTCAAGACC AGCCCGGGCG ACATAGCAAG 
31501 ACCCCATTTC TACAAAAAAT TTAAAAATTG TCATGGTGGT GCACGCCTGT 
31551 GGTCCCAGCT ACTCAGGAGG CTGAGGCAGG TGGATCACAT GAGCCTGAGA 
31601 GGTCGAGGCT ACAGTAAGCT GTGTGATTTC ATCATTGCAC TCTAGCCTGG 
31651 GTGATAGAGT GAGACTTTGT CTCAAAAAAA AAAAAAAAAA AAAAAGTCTT 
31701 AGAGACCAGA AGTCTCTGTA ATCTCTAATA ATCTCTAGGC CCTAGAGCAG 
31751 TGGTTTGTAA ATGGAGGTGA TTTGCTCCCC TCCCCCCAGA GGACATTGGA 
31801 CAATGTCTGG AGACATTTTT GATTGTCCTA ACCGGCAGGA ATCGGGTGCT 
31851 ACTGGCATCT GGTGAGTAGA GGCCCAGGAT GATGCTGTGA TCCTCAGGTG 
31901 TGATCCTGTT GAGAATGAAA CACTGTAGAC TTTATGAAAA CATACAAGAC 
31951 CCTCATCATT TTTCCTTTGC CTGAGCTCCC TCCCCAGAGG TTACCTCTGT 
32001 TCATGGTTTT GTGCATCCGT CTAGTCCCCC TGTTACGCGT TTACAGGAAT 
32051 ATGGTTTGCA ACAGTGTTTT CATCTAAATA GAATTATACA AAATAGCGAT 
32101 TTCTGATTTC TCTTGCATAT TGCACATTCT TCTTATACTT CCTCCCTACC 
32151 TTTATCTGAC ACAGAAATGC TGTATGTCCA GAACTTCTAT CAGAGGCACC 
32201 TATGGAAGTC TAAGGGAAGA CCACATCGCT TTTAAAAACC CTAAAATTTT 
.32251- GTAGTCACTA GATGAAAATA TTCAGCCAGT GACCCAAAAA ATTGCTACCA 
32301 ATGAGACTCT CCATTTTGCC ATGTAGCCAG AACTTACTTT GATCTATGTG 
32351 CCTGGGGTAG TGACCAAGTA GGTGGGTAGG AGTAATCTCA GGGAAACTTG 
32401 AGGCCCCAGC CTCATGGCTA GGGTCATAAT TTGAACCCAG GTCTGTCTGA 
324 51 CATCAGAATC CATGATGTTA ACCCCAATTC TAAGGGGTTC AACTACCCTT 
32501 TCTAAATGGA ATCCTGCTAT ATTAAGCACT ATTTATTCAT TTTATATAAA 
32551 CTAGAAACAT TTTATGTAGT AAGTAGTTGA GAGTGTTTTG GTTTTGCAGT 
32601 TTGATCACTA GTTTTAGAAA CCAGTTTTTA AACACTTTGT GGCCAATTCC 
32651 ATTACTATAT TAAAATTCAG ATTTATTTGG TTTTTCCTTA ACTATTGGGA 
32701 TTAAATCCTG GTTGTAATTC ATAGTTTGAG GGCGAGGGTG GGCAGTCTAC 
32751 ATTTGGCTGA GCCCTGTTTT TGTGAATAAA TGTTATCAGA ACACAGCCAC 
32801 ACCCATTTGC TTCTATGTCT TCTGTGGCTG CTTTTGCAAT GTGACGGCCG 
32851 AGTTGAGGAG CTGCAACAGG CGATGACTTG TAAAGCTGAA AATATTTTTT 
32901 GGCCCTTGAA TAAGAGGTTG GCTGACTTCT GACTTAGGGC ATCAGTTGTT 
32951 CTGTTATCCC AGTAAAACTC AAGGCATTAG GGGAGAAATG TTAATATTAA 
33001 TACTTAAGTT GATTTGATTT AGGGAAATCT TTGAAGATTT CTAAGTCTTA 
33051 AGCAGTAGAA CCTGTTAATG GTTTTAGTTT CAGCAGTAAG GACATTTTAC 
33101 AAGTAAAGTT TTAAATGAAA ACATTTTGTA TGAAGCCACA AGTCGTCTGG 
33151 CCTCTTGCTG GTGTCCAGAT ATTAACACTG ATCCTATTTC TCCTTGCTGA 
33201 CCAAGTCTGT CCTTTGTAGT AAGAAAGGAA GAAACGTTGA CTCTGTCCGA 
33251 TCTCTGGACT TAGTGTTGTA GCGAGCATGC ACCTGGAAGG GACTTGCCAG 
33301 AGGACCTCCT CATGCTTCTC CAGTGCTTAG TGGGGGCTTG GAGTGCAGCC 
33351 CCAGGTCTTC ACGAGCAGTT GGCCACACTG CAGGGCCCTC ACCCCACTCT 
33401 GGAGCAGCCT CTGCTTCAAA CCAGCCTGGA TGCTTGTCAG CTGGGGAGAA 
33451 GATCAACCTG CTATTTTGGG ATAGAAATAA ATGCTCAGCC AAACGGCCAG 
33501 AAACCCCCAT TCCCCTCTCT GCCAAAGTGA ATTCCTTGGC AGGGAGAAGC 
33551 TTGTTCGTGT CTCTGCACAC TTCCTGTGCC CTCCTGTGGT TAAGTCAGAG 
33601 AATCATCCGG CTCTTTGAGC CCCAGGTGCC TAGCTGCTCA AGGATGGTCC 
33651 CCAGCCAGCA GCTGCCAGGA ATCACCTGGG AGCCCATTAA GACATCCAGC 
33701 CCCCACCCAA ACCTATCGAA TCAGAATCTG CCTTTTTTTC CCAAATGATG 
33751 TTTTTGCTTT AATGGAAGTT TAGATGTTCA TAGACAAGAG TTTTAAATGA 
33801 TGATCAAGCT GATTCCATAT TCGCAGTTGT AAGTAGAACT GCTGAGACGT 
33851 GGAAGTACCA CAT GG ACT C A CAGAGGAGCT GCTGTATGTA GCACAGCATT 
33901 GCACAAGAGC TTATTTCAGT CTAGTAAACA TTTATAGGAG CCTGTGTCAT 
33951 TTAATCATCA AGCCTCGCAC TGTGGCTCAC ACCTGTAATC CCAAAACTTT 
34001 GGGAGGCTGA GGCAGGCAGA TCACTTGAGG TAAGGAGTTC GAGACCAGCC 
34051 TGGCCAATAT GGCAAAACCC TGTCTCTACT AAAAATACAA CATTTAGCCA 
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34101 GGTGTGGTGG TGCACACTTG TCATCCCAGC TATTCCGGAG CCTGAGACAT 
34151 GAGCATCGCT TGAACTCGGG AGGTGGAGGT TGTAGTGAGC TGAGATGGCA 
34201 CCACTGCACT CCAGCCTGGG CAACAGGGTG AAGGCCCTTT CTCAAACTCC 
34251 TCAAGTATTT GGCTTCAACT TTATGCCGGG CATGTAGATG AAAAGTCGGC 
34301 TATGACCTGT CCTTGACAAG CAGATGTAAC TCCTTGATTG AGGCTAGTAG 
34351 GTTTTTAAGA CCTGAATAAT TGAGTTTGCA GAAACCTACT GTGTGCCTTC 
344 01 AGGTAAATGG AGAGTGGGGT TTGGTCTAGC AACGAAGCAT CTAGAAGGTC 
344 51 TCTTTGGCCT TACCGGCTCT GTTTTAGGTA AGTCCACGTC TGAGTACCAG 
34 501 TGACTGCAGC TCTTCCAGTT GTGCTGTCAT GTTTATATGT TAGAAATGAT 
34 551 CATCAAAGGA CTCAAAAGTT TTGCCACTAA TTGTATTACC GGGGACTGTC 
34 601 ACAACCAAGA TTTCTCTTAA TTTATTCACC TTACTTATCT CCTGGAAGGG 
34 651 CATATTGAAG TGCTCTTGGA GTTCTCTAAA AGGGTTTTTG TTGGTTGTGT 
347 01 ATATTCACTT GGGTGCCAGC GATTGATTCC AAATAAGTAA ATCTTTTTTC 
34751 CCAAAAGGAT GTAAGATGGC TTATGGTTAT AAGTACAACA GGCTAACAAA 
34801 GTACAAGTAG ATGAGAAAGT AAAATGAAGA AATAAAGTCA TAGGAGCCAC 
34851 AGAATTAACC CAGGAATGAA TAAGTGTGTA GTTTGGTGCT GATGTTATCA 
34 901 TCCTTTATTT GTACATTGCT TGTACAGTTG CTCTGAGAAG GTAAGTCTTA 
34 951 AATTTTCAAA AGTGAAATGT CACCGAGCAT GGTGGCTGAT GCCTCTAATC 
35001 TCAGCACTTT GGGAGGCTGA GGCAGGCGGA TCACTTGAGG TCAGGAGTTC 
35051 GAAACCAGCC TGACTTATGT GATGAAACCC TGTCTCTACT AAAAAAAAAA 
35101 AAAAAAAAAA AAAAAAAAAA AAAAATCCAA AAGTTAGTTG GGCATGGTGG 
35151 CAGGTGCCTG TAATCCCAGC TACTTGGGAG GCTGAGGCAG GAGAATCGCA 
35201 TGAACCTGGG AAGTGGAGGC TGCAGTGAGC CAAGATTGCA CCACTGCACT 
35251 CTAGCCTGGG TGACAGAGCG AGACACCATC TTAAAAAAAA AAAAAAATCT 
35301 ACAATATACC AAAACC AT T A CTTACCTGAG AAACTATTCT CAGGGTCATT 
35351 GTAGTGAATG CCTATTTTAT GGCTTTTGAT GGCATCAGGG CACTCAGGTC 
354 01 ATTTACAAGA GTAGTGTGTG AGACCCTGTG TGTCACTGCC ACTCATCTTG 
354 51 GCCTTCGGCC ACTGCTGTAG CAACCAGTTT CCAAGTAGGG CTGGACCTTG 
35501 CCTTCTGCTC CAGAGACCTC TCGCTTCCTG CCCTTGGGCT TCTGACGAGC 
35551 TGCAGGAACT GCCTGGCACG TGGGTCCCCA CAACCCAGAG GAGGTGAGGG 
35601 CCACCTCTCT GCTCCTCAGG GCCACCTTTC ATAAGGCTCC TTGAAGGTCC 
35651 CTCAAGATCA AGCCAACTCA ACACATCCTT GATAGGCCTT CCTGCCTTCT 
357 01 GTTTCACTTC TCCACTCGTT TCCAAATAAA TGGCTGCATG CAAGCTTTTG 
35751 CCTCAGGTTC TGCTTTTAGG AGGAAGGCTA AGACAAGCAG TAAAGCAACA 
35801 TGGGCAGGCA GAAGGATGAC TTCTAATAGA ATTATCTCAT CACTATATAT 
35851 TTTACTTTAT GGATGCTTGT ATTGAAAAGT CTTGGCTGGG TGGAGTGGCT 
35901 CACGCCTGTA ATCCCAGCCC TTTGGGAGGC CGAGGTGGGT GGATCACTTG 
35951 AGGTCTGGAG TTTGAGACCA GCCTGACCAA CACTGGTAAA ACCTTGTCTC 
3 6001 TATTAAAAAT GCAAAAATTA GCCAGGGATG CACGCTTGCT GTGTGCCAGC 
3 6051 ACAGGGCTAG GCTGGAGATA AAAAGGTGAG TAAGTAGGTG CGGTGTAGTC 
3 6101 AGGGTGAAAA CTACAGATGG TCCATTTCCA CGTAAGTGGA AAGGTAAAGG 
36151 TATGTACAAT AGGGTGGCTC CTGGCTGAAC CTGGAGCTGC AGACAGGTTT 
36201 TCTAGAAGGC ATAATCCTGA AGTTGAGACT TGGGGGCCTA GGTAGGAGCC 
36251 AGTTGAAGGG ACGTGGGAGG CGCATTCCAG AGAGAAGGAG TGGTATGAGA 
36301 CTGGAACAGA GGTGTGCAGC AGCATCGCAT GGGCGAAACA ACAGTAGACA 
3 6351 GTTGTTCTTT TGTTTTTGTT TGTTTTTTGA GACAGGGTCT TGTTCTGTCA 
3 64 01 TCCAGGCTGG AGTGCAGTGG CATGATCTCG GATCACTGCA ACCTCCACCT 
3 64 51 CCCAGGCTCA AGTGATCTTC CCACCCCAGT CCCCAAGTAG CTGGGGGACC 
36501 ACAGGTGCAT GCCACGATGC CCGGCTAATT TTTGTACATT TTGTAGAAAC 
36551 AGGGTTTTAC TGTGTTGTCC AGGCTGGTCT TAAACGCCTG AGCTTAAGCA 
36601 GTCTACATGC CTCAGCCTCC TGAAGTGCTG GGATTCCAAA CATGAGCCAC 
36651 TGTGCCTGGC CCGGCAACTG TTACTAGACT ATAGAGAGGG AGGTGGGCAA 
36701 GGGCTGGTGA CACTAGACAG GTGCAGTAGG TCTGGACCAT GGGTGGCCTT 
3 6751 GCGCTACACA TTACAGAGCT CAGGCTTTTT TTCTCCAGGT GAGAGGGCTG 
3 6801 GTGCCACTGA GGCATCAAGC AGAGGTTTGA GATCTCCTTG GTGACAGTGT 
36851 AGAGCAGACA GGTAGATTTG GGAATTTAAG CTTAGACTCA CGTTGGAGAC 
3 6901 TGAGATAGCT CATCTGAGAG GCACTCAGGG CCTAATCTCA GGCAGTAATT 
36951 TTAGGGATGT AGGGGAAGAG ATGGATTCTG CACATACTTG GGAGGCTTGT 
37001 GGAGGAGTGG GGAGGGAGGC ACAGGGAGGA CTCCAGGGTG GTTCATACGG 
37051 CTCCCTGCTT CTGTTCCTGT CCCCCTTTGT CAAGCTGTGG TCTGTACTGC 
37101 GTGTTCCATC TTGTTTCTAA GCTGCTTTTG CCCAGTCTTT CCAGCATTTC 
37151 CCTTTCGTCA TGTTAGTCTG TGCCTGTCTA CGTGAACTAT GGTGACGTTT 
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37201 ATTGGGCCTG GCACTGTGAG GTGCTGGGGA TGTGAAGATC ATTGTGGCTC 
37251 AGCCGCTGCT CTCGAGGGCC TCTGGGTGCA GTATGCACAC CTGTGCCTCC 
37301 TGTTTGCTCA GGAAGACAGG CTTTGAGATG AGCTGGGGCT GACATCCCCA 
37351 CCTTATCATT GGGATGGCTT TGGGTAAGTT ATGTTCATGT TCTCTGAGCC 
374 01 TCCCTTTCCT CATTGGTAAA ATGGGTATAA AATACCTGCC AGTGGAGGGT 
37451 TGTTGTAAGT AGCCATGGAA AATGTAAAGC ACATAGCACT TACCATTTTT 
37501 TCCTGTGTCT TTAACAGATT TAT CAT AG AA TCCCCGACTC AGACCCATCT 
37551 TCTAGCAAAA CAAGGCAGAT CATCTCCACC ATCCGGACGC AGAATCTTCC 
37 601 CAACTGTCAG CTGATTTCCC GAAGCCACTA CTCCCCCATC TACCTGTCGT 
37 651 TTGTCATGCT TTTGGCTGCC CTGAGCTGGC AGTACCTGAC CACTCTCTCC 
37701 CAGGTAACAG AAGACTATGT TCAGACTGGA GAACACTGAT CCCAAATTTG 
37751 TCCATAGCTG AAGTCCACCA TAAAGTGGAT TTACTTTTTT TCTTTAAGGA 
37801 TGGATGTTGT GTTCTCTTTA TTTTTTTCCT ACTACTTTAA TCCCTAAAAG 
37851 AACGCTGTGT GGCTGGGACC TTTAGGAAAG TGAAATGCAG GTGAGAAGAA 
37 901 CCTAAACATG AAAGGAAAGG GTGCCTCATC CCAGCAACCT GTCCTTGTGG 

37 951 GTGATGATCA CTGTGCTGCT TGTGGCTCAT GGCAGAGCAT TCAGTGCCAC 
38001 GGTTTAGGTG AAGTCGCTGC ATATGTGACT GTCATGAGAT CCTACTTAGT 
38051 ATGATCCTGG CTAGAATGAT AATTAAAAGT ATTTAATTTG AAGCACCATT 
38101 TGAATGTTCG TACTAGTAGA AAATGATGTG AATTTTCTTT CTGTTCGGCT 
38151 CCTATTTTTC TCATCATTTT GTTTTCTTTA ATTGGGTTGA ATGGAGTAGA 
38201 TAGAAATATT TATGGTTTAG GTAACAGTTA GATGTTTCCT AAGAATGCAA 
38251 ACTGCCTTTT CCACACAAAG GCTGGGAATA AAATTCTGGG TATTCTCGTA 
38301 TTCTCATTTA AAGGAGTTTA GCTTTCAGAG AGAAACAGCA GGATTGCTTT 
38351 TGACCTTTTA GAAGATTGGT CTCCAGTAAA GGTGGACATT TTTGAGATTT 
38401 TTATAATAAA GAATTTAATT GCTCTGCATT TGTCAAGTAC AGTTCGCTTG 
.38451 AAAGCCTGCC TGACTGTGGA AAAGATGGAG CTCAAGAATG GAGTTGATGG 
38501 CCCAGCGTGG TGGCTCATGC CTGTAATCCC AGCACTTTGG GAGGCTGAGG 
38551 CGGTCGGATC ACGACATTAG GGGATCGAGA CCATCCTGGC TAACACGGTG 

38 601 AAACCCCCGT CTCTACTAAA AAAAAAAAAA ATTAGCCAGG CGTGGTGGCG 
38 651 GGTGCCTGTA GTTCCAGCTA CTCGGGAGGC TGAGGCAGGA GAATGGCTTA 
38701 AACCCGGGAG GCGGAGCTTG CAGTGAGCTC AGATCGCGCC ACTGCACTAC 
387 51 CAGTCTGGGC AACAGAGCGA GACTCCATCT CAAAAAAAGG AAAAAATTGT 
38801 AAAAAAAAAA AAAAAAAAAN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
38851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
38 901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
38951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
39001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
39051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
39101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
39151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
39201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
39251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
39301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
39351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
39401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
39451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
39501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
39551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
39601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
39651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

3 9701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
39751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
39801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
39851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
39901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
39951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4 0001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
40051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN (SEQID NO: 3) 
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Context : 



DNA 

Position 

825 GCAGTGAACGTACCTGACAGGTTTCCTGTTTGTTTTTGAGATGAAGTCTCGCTCTTGTCC 
CCCAGGCTGGAGTGCAATAGCGCGATCTCAGCTCACTGCAACCTCTGCCTCCTGTGTTCA 
AGCGATTCTCCTGCCTCAGCCTCCCAGGTAGCTGGGATTATAGGCGCCTGCCACCATGCC 
TGGCTAATTTTTGTATTTTTAGTAGAGACGCAGTTTCAGCATGTTGGCCAGGCTGGTCTT 
. GAACTCCAGACCTCAGGTGATCCGCCCGCCTTGGCCTCCCAAAGTGCTGGGATTACAGGC 
[G, A] 

TGAGCCACCGCGCTCGGCTAGACCTGACAGGTTTTAAAAGGATTACTGGTTGCTGTGTTA 
AAACAGACTGCAGGATGGCTTAGGTAGCCAGTAGGTTTTTTTTTTTTTTGGAGACGTAGT 
CTTGCTCTGTTGGCCTGGCTGGAGTGCAGCGGTGTCATCTTGGCTCACTGCAAACTCCGC 
TTCCCGGGTTCAAGTGATTCTCCTGCCTCAGCCTCCGGAGTAGTTGGGACTACAGGCGCC 

CACCACCACACTCGGCTTTTTTGTATTTTTAGT AGAGACGGGTTTCACCATGTTGGCCAG { SEQ ID NO: 7) 



2632 GCCGTCCTGGCTGACCTGTCCCTGCCCCCGCAAGCCGCCCTGGGCATGAGCGACTTTTGC 
GTGGTTCCCGGTGGTTGCGCTCCCCGTTTCGTCCCCTCCGTGAGCATCGGCGCTTACCGG 
TATTTTAACCCGAGGGTTACACATCTGAGGCAATGTGGGTGGGTTACGCGGGAGAGGACG 
AGTGAGTTTTTTGGTAAGCGGAATGAACTATGCAGATAACATCACATGAAGGCCGTTTCT 
GGAATGAAGTCTGACTCCTCCAGTTTCACCACCTCTTCCGGAGCTCTCCCCGCCTTGCTG 
[C,T] 

CTTCCATCGCTTCATCCTCGGTGCTTCCTGAGTTTTAAAATCGCCTATCTACGCTTCCAA 
GTTCCAATGAGTTATCTAACGTCTATGGATTAGCTAGGTGGTTGGTGGAAGGTCAGAACT 
TGGTTTTACTTAGATTTTTATCTGCCTCATGCCTGTACTATTTGTTTAATGAATGCATAG 
GAGGTGTTTTTATTCCAACAAGAAAATTATTCGTACGCGATTATTGAATGAATAGACAAA 

TTCAGCCAAGTTCTTCTGGTCTGGACCAGCCTGGCTGATTTCTGTAACTTTTTTGGGCCA (SEQ ID NO: 8) 



44 30 GGCCTTTTTTTTTTTTTTTTTGAGGGGGGGGTCTCACTCCATCGTCCAGGCTAGAATGCT 
GTGGCCTGAACATGACTCACTCCAGTTTTGACTTCCTTGGCTGAAGCCATCCTCCCACCT 
CGGCTTCCTGATCCCGAGTAGCTGGGACTCCAGGCACGTGTCACCAATGCATGGCTAATT 
TTTAAATTTTTTTGTAGACACAATGTCTCGCTGCATTGCCCAGGCTGGTCTTGAACTCCT 
GAGCTCAAGCGATTTTCCCACCTCAGCCTTCAAAGTGCTGGGATTACAGGTGTGAGCCAC 
[C,T] 

GCACCCAACCAGTTTCTCTCTGCAAACTAGGGAAAAAATTTACGCTTAGCAGATATTGAG 
GGCTGATTATTTCTATCACAGAAGCATTTGGCTATAGAATTTCAGGGTTTAGTAAACTTG 
ATTTACACTGAATTTTTAGGTGCATATCAGTAAATCTACGGGCATATGCCGCCTGCAAGT 
TGTGTGGCATCACCCAAAAGCCGAGAGTAATGGAAAGAGCAGGCTGTTAGTAATCAGGCA 

GATCTGGCTCCTGTCCAATCTAAATCCTGTTATTTAGACTAATATCTTAAGTCTGTTATT (SEQ ID NO: 9) 

4791 GGCTGATTATTTCTATCACAGAAGCATTTGGCTATAGAATTTCAGGGTTTAGTAAACTTG 
ATTTACACTGAATTTTTAGGTGCATATCAGTAAATCTACGGGCATATGCCGCCTGCAAGT 
TGTGTGGCATCACCCAAAAGCCGAGAGTAATGGAAAGAGCAGGCTGTTAGTAATCAGGCA 
GATCTGGCTCCTGTCCAATCTAAATCCTGTTATTTAGACTAATATCTTAAGTCTGTTATT 
AAGTCCGATTTCTGACGCTATTAAGTTAGGTGAACAACCTTGGTAACTTAACCTCTGAAC 
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[C,T] 

ACAGTTACTTCATCTGTAAAATAGGGATGTATGTATGGTAACGATTTTTTAACCACAACT 
TCCCAACTCTAAGATGGTCTGAAAAGAATTTTTTGAGTGTTTGGCTCAGAATCACTTGGC 
AGCAAAACCTGACTTGAAGTTGAGGCTTCATTCATCCCACTTAGTATATTCAAATGTTTT 
GCTAAAGAAATAATTATGAGGTGCTACTTCACACTGACTAGGGTTGTATATGCATTTTAT 

TGCCTATTTTCTAAAACACTAAAAATGCTAAATTCTGCCCCAGGTCTTGCCACAGATGTT (SEQ ID NO: 10) 

4 886 - CTACGGGCATATGCCGCCTGCAAGTTGTGTGGCATCACCCAAAAGCCGAGAGTAATGGAA 
AGAGCAGGCTGTTAGTAATCAGGCAGATCTGGCTCCTGTCCAATCTAAATCCTGTTATTT 
AGACTAATATCTTAAGTCTGTTATTAAGTCCGATTTCTGACGCTATTAAGTTAGGTGAAC 
AACCTTGGTAACTTAACCTCTGAACCACAGTTACTTCATCTGTAAAATAGGGATGTATGT 
ATGGTAACGATTTTTTAACCACAACTTCCCAACTCTAAGATGGTCTGAAAAGAATTTTTT 
[G,C] 

AGTGTTTGGCTCAGAATCACTTGGCAGCAAAACCTGACTTGAAGTTGAGGCTTCATTCAT 
CCCACTTAGTATATTCAAATGTTTTGCTAAAGAAATAATTATGAGGTGCTACTTCACACT 
GACTAGGGTTGTATATGCATTTTATTGCCTATTTTCTAAAACACTAAAAATGCTAAATTC 
TGCCCCAGGTCTTGCCACAGATGTTTCAGTGGACTATGGGCCTGTGAGACCTTAAAGGGT 

TGATTGAGTAAGGATCACAGGTGATGTCCGCATTGTGCTTGGCATGGAGTTAAGTGCTTG {SEQ ID NO: 11) 



4887 TACGGGCATATGCCGCCTGCAAGTTGTGTGGCATCACCCAAAAGCCGAGAGTAATGGAAA 
GAGCAGGCTGTTAGTAATCAGGCAGATCTGGCTCCTGTCCAATCTAAATCCTGTTATTTA 
GACTAATATCTTAAGTCTGTTATTAAGTCCGATTTCTGACGCTATTAAGTTAGGTGAACA 
ACCTTGGTAACTTAACCTCTGAACCACAGTTACTTCATCTGTAAAATAGGGATGTATGTA 
TGGTAACGATTTTTTAACCACAACTTCCCAACTCTAAGATGGTCTGAAAAGAATTTTTTG 
[A,T] 

GTGTTTGGCTCAGAATCACTTGGCAGCAAAACCTGACTTGAAGTTGAGGCTTCATTCATC 
CCACTTAGTATATTCAAATGTTTTGCTAAAGAAATAATTATGAGGTGCTACTTCACACTG 
ACTAGGGTTGTATATGCATTTTATTGCCTATTTTCTAAAACACTAAAAATGCTAAATTCT 
GCCCCAGGTCTTGCCACAGATGTTTCAGTGGACTATGGGCCTGTGAGACCTTAAAGGGTT 

GATTGAGTAAGGATCACAGGTGATGTCCGCATTGTGCTTGGCATGGAGTTAAGTGCTTGA (SEQ ID NO: 12) 

4889 CGGGCATATGCCGCCTGCAAGTTGTGTGGCATCACCCAAAAGCCGAGAGTAATGGAAAGA 
GCAGGCTGTTAGTAATCAGGCAGATCTGGCTCCTGTCCAATCTAAATCCTGTTATTTAGA 
CTAATATCTTAAGTCTGTTATTAAGTCCGATTTCTGACGCTATTAAGTTAGGTGAACAAC 
CTTGGTAACTTAACCTCTGAACCACAGTTACTTCATCTGTAAAATAGGGATGTATGTATG 
GTAACGATTTTTTAACCACAACTTCCCAACTCTAAGATGGTCTGAAAAGAATTTTTTGAG 
[T, A] 

GTTTGGCTCAGAATCACTTGGCAGCAAAACCTGACTTGAAGTTGAGGCTTCATTCATCCC 
ACTTAGTATATTCAAATGTTTTGCTAAAGAAATAATTATGAGGTGCTACTTCACACTGAC 
TAGGGTTGTATATGCATTTTATTGCCTATTTTCTAAAACACTAAAAATGCTAAATTCTGC 
CCCAGGTCTTGCCACAGATGTTTCAGTGGACTATGGGCCTGTGAGACCTTAAAGGGTTGA 

TTGAGTAAGGATCACAGGTGATGTCCGCATTGTGCTTGGCATGGAGTTAAGTGCTTGATA (SEQ ID NO: 13) 

5110 AAATAGGGATGTATGTATGGTAACGATTTTTTAACCACAACTTCCCAACTCTAAGATGGT 
CTGAAAAGAATTTTTTGAGTGTTTGGCTCAGAATCACTTGGCAGCAAAACCTGACTTGAA 
GTTGAGGCTTCATTCATCCCACTTAGTATATTCAAATGTTTTGCTAAAGAAATAATTATG 
AGGTGCTACTTCACACTGACTAGGGTTGTATATGCATTTTATTGCCTATTTTCTAAAACA 
CTAAAAATGCTAAATTCTGCCCCAGGTCTTGCCACAGATGTTTCAGTGGACTATGGGCCT 
[G,Tj 

TGAGACCTTAAAGGGTTGATTGAGTAAGGATCACAGGTGATGTCCGCATTGTGCTTGGCA 
TGGAGTTAAGTGCTTGATAAATGGTGGTTATCAATCTGATTATGTAAATTTATGTAAATT 
CAGTTCTCAAGTTTGTGGTTTTTTTCCCCTCCTGGAGAAATCTATTCTATTTTAAAGTGA 
GGAAGGCTCCGTGGAGGGCTGGTAGCTGGTAGCTGTTCACTTGTGGAACTTTCAGCCTGA 

GGCTGGAGCCCCTTCCTGGGAGTCTGGTCTTGTCGTCTTCCTGACCACCCCCACACCCTT (SEQ ID NO: 14) 



6911 CCACCTTGGCCTTCCGAAGTGCAGGGATTATAGGCGTGCGCCACTGCACCCGGCCCTGTT 
GGATAAATGATTCCAGTCTCTCCCAAAAAGAACTGTTGTAAGACTGTGGGGTGAGGGGAG 
GGAAGGGACAAATAGGAACCCGCCGTATTTTCCACTCCCTGTGGGCCTAAAACTGCTCTA 
AAAAATAGTCCATGAAAAAATACATAGTACAAACAGCAACTCTTTCTGATATGCTTGCAT 
TTAAAATCAGGCTTTTTCTCCCTTTTGGAAAAACACAGTCCTTGTTTGCTTTAGGGAAGA 
[G, A] 

TAAAGGTCAGTGCGCTGCATTGCATTAATTTCGAAGGGAAAGATGAGAAGACATCTTGAA ' 



FIGURE 3P 



Docket No. CL001201DIV 
Application Serial No. 10/644,021 
Inventors: Ming-Hui WEI et al. 
Title: ISOLATED HUMAN ENZYME PROTEIN. 



AGGAATGGCTGGCTTTCTAGAGAATAGTAGAGGCTTAATAGGTGTCATAGAAAAACCAGG 
GTTGGACAGTGGTAGTAAAACGGCAAAACAGATTTTATTCAGAAAAACTACTGCAGTAAG 
AGGAGAGAGACCTCGGTACAGAACTGCTCCACTGCGAATACAAAGAAAAGTAGGAATTGA 

TGGCGGGGGAGCCGGATGTCAGTGGATGGAAAATTATTACGAGGAAACACAGGGGTGTGC (SEQ ID NO: 15) 

7212 TAAAGGTCAGTGCGCTGCATTGCATTAATTTCGAAGGGAAAGATGAGAAGACATCTTGAA 
AGGAATGGCTGGCTTTCTAGAGAATAGTAGAGGCTTAATAGGTGTCATAGAAAAACCAGG 
GTTGGACAGTGGTAGTAAAACGGCAAAACAGATTTTATTCAGAAAAACTACTGCAGTAAG 
AGGAGAGAGACCTCGGTACAGAACTGCTCCACTGCGAATACAAAGAAAAGTAGGAATTGA 
TGGCGGGGGAGCCGGATGTCAGTGGATGGAAAATTATTACGAGGAAACACAGGGGTGTGC 
[A,G] 

TTCTTGCTGAAGGCAGGCCAGAGTTATCAGACATCACCTGAGGGATGGAGGGGGATGTGG 
AACCTAATCGGCTGTCTAGGGTGATCAGATACTGAAGTTGGGGGATTCTGGTCAAATCAA 
TTTAGCAGGATTCTTGGTAAAACTGGGCGATGCAAAGACAGATGCGTTGAGTACAAAGTC 
CAGGCTTTATTGGGAAGAGGATTTCAGCGGAGCCCGAGTAGAGTTTGGTCTAGGGAGACT 

CTGTCACTGGGAGGACGAGCGAGCCGCTCGGAAGTGCGCTGGGTTCTCTTAGCGGCCAGT (SEQ ID NO: 16) 

7355 CAAAACAGATTTTATTCAGAAAAACTACTGCAGTAAGAGGAGAGAGACCTCGGTACAGAA 
CTGCTCCACTGCGAATACAAAGAAAAGTAGGAATTGATGGCGGGGGAGCCGGATGTCAGT 
GGATGGAAAATTATTACGAGGAAACACAGGGGTGTGCATTCTTGCTGAAGGCAGGCCAGA 
GTTATCAGACATCACCTGAGGGATGGAGGGGGATGTGGAACCTAATCGGCTGTCTAGGGT 
GATCAGATACTGAAGTTGGGGGATTCTGGTCAAATCAATTTAGCAGGATTCTTGGTAAAA 
[C,T] 

TGGGCGATGCAAAGACAGATGCGTTGAGTACAAAGTCCAGGCTTTATTGGGAAGAGGATT 
TCAGCGGAGCCCGAGTAGAGTTTGGTCTAGGGAGACTCTGTCACTGGGAGGACGAGCGAG 
CCGCTCGGAAGTGCGCTGGG.TTCT.CTTAGCGGCCAGTGGGTTCTGGTGAGAAGGGCAACA . - 

GCGGGAGGAGGCGCCGGTGCGGAGCGGGAGGCCGGGGGCGGGGCTGCGGGGCTGCGGGGC 

GGGCCCGTTGTGGGTCGGCCCAGCGCGTATTCGAGTAGAGGGCGAGCCCGTCCCGCCTCT (SEQ ID NO: 17) 



7 398 GAGACCTCGGTACAGAACTGCTCCACTGCGAATACAAAGAAAAGTAGGAATTGATGGCGG 
GGGAGCCGGATGTCAGTGGATGGAAAATTATTACGAGGAAACACAGGGGTGTGCATTCTT 
GCTGAAGGCAGGCCAGAGTTATCAGACATCACCTGAGGGATGGAGGGGGATGTGGAACCT 
AATCGGCTGTCTAGGGTGATCAGATACTGAAGTTGGGGGATTCTGGTCAAATCAATTTAG 
CAGGATTCTTGGTAAAACTGGGCGATGCAAAGACAGATGCGTTGAGTACAAAGTCCAGGC 
[T,C] 

TTATTGGGAAGAGGATTTCAGCGGAGCCCGAGTAGAGTTTGGTCTAGGGAGACTCTGTCA 
CTGGGAGGACGAGCGAGCCGCTCGGAAGTGCGCTGGGTTCTCTTAGCGGCCAGTGGGTTC 
TGGTGAGAAGGGCAACAGCGGGAGGAGGCGCCGGTGCGGAGCGGGAGGCCGGGGGCGGGG 
CTGCGGGGCTGCGGGGCGGGCCCGTTGTGGGTCGGCCCAGCGCGTATTCGAGTAGAGGGC 

GAGCCCGTCCCGCCTCTCGTCGGGCGCTTCCCAGATCTGCTTGAGTCTATGGAGGAAAAA (SEQ ID NO: 18) 



7 653 AACTGGGCGATGCAAAGACAGATGCGTTGAGTACAAAGTCCAGGCTTTATTGGGAAGAGG 
ATTTCAGCGGAGCCCGAGTAGAGTTTGGTCTAGGGAGACTCTGTCACTGGGAGGACGAGC 
GAGCCGCTCGGAAGTGCGCTGGGTTCTCTTAGCGGCCAGTGGGTTCTGGTGAGAAGGGCA 
ACAGCGGGAGGAGGCGCCGGTGCGGAGCGGGAGGCCGGGGGCGGGGCTGCGGGGCTGCGG 
GGCGGGCCCGTTGTGGGTCGGCCCAGCGCGTATTCGAGTAGAGGGCGAGCCCGTCCCGCC 
[T,C] 

CTCGTCGGGCGCTTCCCAGATCTGCTTGAGTCTATGGAGGAAAAACTCCGCGGGGTCCGC 
GATTCCCATGGCCGCAGCCGCCTGCGGCACCAAGGCCATGGCCCTCTTCAAGCGCACCTT 
GGTGCTGAGTCCCGCCGCGGCGCCCAGGGGCCCGGGCGCAGGCACCGCCCCGCGGGGCTG 
CTGCTTGCCTCCTGCCGCCTGGCCCTGCAAGGACTGGCCTCGGGGAGAGGGCGGCAGGCT 

GTGGAGCCGCCTGCCCCAGTCCCAGTCCCACTCCCACTCCCACTCCCACTCCCACTCCTG (SEQ ID NO: 19). 



8310 CAGCCTGAAAACTTGCTACAAGTATCTCAATCAGACCAGTCGCAGTTTCGCAGCTGTTAT 
CCAGGCGCTGGATGGGGAAATGCGGTGAGTGATGGAGGCAGCGCCTCTGGCTTGGAGGAA 
AGCTTGTCCGGGACTTTTGAGTGTGTTGGAAGCTACCTTTTGATATAGCGCTCAGCGTTG 
CAGCCTCGTTGCTGTGGCTTATCCAGAACATAGCCCGGCCCTACGTGTTTACTTTAGAAA 
■ GCCCTTCCAGGCTCTTTGCCATCTAGTAGAGTCCCTGCGGGCCCAGCCTTTCAGAGAAG 
[A, G] 

GGGGGGAGGGGGTGATGTTTATTAACTTTTTTTAGTCTTGGCAGCTGAACCTGCCTGTGA 
GCAGGTCGTGTATTTCTCGGCTTCCCTTATCCAACTTTGCATTTCTATTTCTAGCATATT 
GGGTTGATTCTTTTGAAGCTGCCTCTGTGCACATTACACCCATGAACTTAGACCAGTTGC 
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CTTTATGTATGATCGTATTTATACTGAGAAGTTACTGTGTTTTTTGACTTTCTTTTCTAT 

TTGCTACATATTAGTTCGGTCTAAACGTTTGGTCTTCTGGTCTCCATAGTTCTACATTG (SEQ ID NO: 20) 

814 5 CAGCCTGAAAACTTGCTACAAGTATCTCAATCAGACCAGTCGCAGTTTCGCAGCTGTTAT 
CCAGGCGCTGGATGGGGAAATGCGGTGAGTGATGGAGGCAGCGCCTCTGGCTTGGAGGAA 
AGCTTGTCCGGGAC 
[C,T] 

TTTGAGTGTGTTGGAAGCTACCTTTTGATATAGCGCTCAGCGTTGCAGCCTCGTTGCTGT 
GGCTTATCCAGAACATAGCCCGGCCCTACGTGTTTACTTTAGAAAGCCCTTCCAGGCTCT 
TTGCCATCTAGTAG (SEQ ID NO: 21) 

8031 CAGCCTGAAAACTTGCTACA 
[G, A] 

GTATCTCAATCAGACCAGTC (SEQ ID NO: 22) 

84 62 GCTACCTTTTGATATAGCGCTCAGCGTTGCAGCCTCGTTGCTGTGGCTTATCCAGAACAT 
AGCCCGGCCCTACGTGTTTACTTTAGAAAGCCCTTCCAGGCTCTTTGCCATCTAGTAGAG 
. TCCCTGCGGGCCCAGCCTTTCAGAGAAGGGGGGGGAGGGGGTGATGTTTATTAACTTTTT 
TTAGTCTTGGCAGCTGAACCTGCCTGTGAGCAGGTCGTGTATTTCTCGGCTTCCCTTATC 
CAACTTTGCATTTCTATTTCTAGCATATTGGGTTGATTCTTTTGAAGCTGCCTCTGTGCA 
[G,C] 

ATTACACCCATGAACTTAGACCAGTTGCCTTTATGTATGATCGTATTTATACTGAGAAGT 
TACTGTGTTTTTTGACTTTCTTTTCTATTTGCTACATATTAGTTCGGTCTAAACGTTTGG 
TCTTCTGGTCTCCATAGTTCTACATTGGTTAAATGCAACTCACTTCTGGGAGTAGTGGTG 
ACATTCAACTAGTAGGCTTTTTAATAAACTACAGAAGTTCATTACTCTCATGTAAGGAAG 

GAAAACTAATGTAACTTTCGTTAAGTATGAAAAGCGTTGGATATCCTTATAGTTCTTTAG - (SEQ. ID. NO: 23)- 

8873 AAACGTTTGGTCTTCTGGTCTCCATAGTTCTACATTGGTTAAATGCAACTCACTTCTGGG 
AGTAGTGGTGACATTCAACTAGTAGGCTTTTTAATAAACTACAGAAGTTCATTACTCTCA 
TGTAAGGAAGGAAAACTAATGTAACTTTCGTTAAGTATGAAAAGCGTTGGATATCCTTAT 
AGTTCTTTAGAGTTAAGGGTGAGATGGGTTTAGAAAGTGGCCAGGCACAAGTTATTTTAA 
AATAAAAAATCTTTGGCTGTTTGTTCCAATATATTAATAGTTTTCCCTTTTTTACAGCAA 
[C,T] 

GCAGTGTGCATATTTTATCTGGTTCTCCGAGCTCTGGACACACTGGAAGATGACATGACC 
ATCAGTGTGGAAAAGAAGGTCCCGCTGTTACACAACTTTCACTCTTTCCTTTACCAACCA 
GACTGGCGGTTCATGGAGAGCAAGGAGAAGGATCGCCAGGTGCTGGAGGACTTCCCAACG 
GTGAGTGGGGTTACGCATCTTGTCTACGGACTGTTGTGTTCATAATTGCTAACGTGGTTG 

TCCGGTAGCCTCCATACATGTGGAGAAAGGTTAAATAAGCATTCTGAGGGCAGCATAATG (SEQ ID NO: 24) 

9190 ATCTGGTTCTCCGAGCTCTGGACACACTGGAAGATGACATGACCATCAGTGTGGAAAAGA 
AGGTCCCGCTGTTACACAACTTTCACTCTTTCCTTTACCAACCAGACTGGCGGTTCATGG 
AGAGCAAGGAGAAGGATCGC.CAGGTGCTGGAGGACTTCCCAACGGTGAGTGGGGTTACGC 
ATCTTGTCTACGGACTGTTGTGTTCATAATTGCTAACGTGGTTGTCCGGTAGCCTCCATA 
CATGTGGAGAAAGGTTAAATAAGCATTCTGAGGGCAGCATAATGTGAGGGTTAAAAACTC 
[C,T] 

GGTAGCCAAGACTCTGAAGCCAGGCTGCCTGGGTTGGAATCTCAAATCTCCCACTTACTA 
AACTGTTGGTTACTTACAAAGACTCTCTGTGCCTCAGTTTCTTCATCTGTAAAATAGGGG 
TAATAATAACACCTACCTCATGGTATTCTGAGGATTCAAAGAATTAACGTAGGTAATGCT 
CTTAGAATGTTAGCTACTGCTGTTATTATCAGTATTGGAAGTCCAGTGTTTCTTCCTGTG 

GGAAGACGCAGTCAAATTTTAGTGTTGTGAAAGATTCTCAGGCTAGCTCACAAAAGCCTG (SEQ ID NO: 25) 

9311 GAGCAAGGAGAAGGATCGCCAGGTGCTGGAGGACTTCCCAACGGTGAGTGGGGTTACGCA 
TCTTGTCTACGGACTGTTGTGTTCATAATTGCTAACGTGGTTGTCCGGTAGCCTCCATAC 
ATGTGGAGAAAGGTTAAATAAGCATTCTGAGGGCAGCATAATGTGAGGGTTAAAAACTCC 
GGTAGCCAAGACTCTGAAGCCAGGCTGCCTGGGTTGGAATCTCAAATCTCCCACTTACTA 
AACTGTTGGTTACTTACAAAGACTCTCTGTGCCTCAGTTTCTTCATCTGTAAAATAGGGG 
[T,-] 

AATAATAACACCTACCTCATGGTATTCTGAGGATTCAAAGAATTAACGTAGGTAATGCTC 
TTAGAATGTTAGCTACTGCTGTTATTATCAGTATTGGAAGTCCAGTGTTTCTTCCTGTGG 
GAAGACGCAGTCAAATTTTAGTGTTGTGAAAGATTCTCAGGCTAGCTCACAAAAGCCTGC 
CGACTGTATGATGCAGCCTACCTGTAACACTGCTGGCCTCTTGACTACCCGGAGCCTGGT 

AGCATGGGACTGCTGCTCACGATGGGCAGCAGCCTGGCATGGGGGCGGTGTCTGTTGGCA (SEQ ID NO: 26). ' 
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9847 CTGGTAGCATGGGACTGCTGCTCACGATGGGCAGCAGCCTGGCATGGGGGCGGTGTCTGT 
TGGCAGCTAGGGCGAGCCTCTGCCACTTCACCTGTGATCCTGGGCAAGTTCCTTATCTGC 
TTTGTGTCTCCGTCTCCTCGTTTGTAAAGTTAGAGCTGAGAGGATTAATTTCGCACATAT 
AAAGTACTTAGTGCCTGGTACAGGGTAAGTATTCTGTAAGTATTAGCTATTTGGTCTATT 
TTGTTGGAGTAAAGTGGGTTATAGTTAAAATCCTAAGATTTTTAAAGTCCCTCAAGTTCA 
[T,C] 

GTGGACATCTGCCTAGGTCCTACTATCCTAGAATTCGCATGTCTTATCACACAAATAACT 
GATTCTTCCATATCTTATAAATAAAGGTTTGATTTAGCAAAGTCACATGTTGTGTAATAG 
CTCGAAGAAGCCCTTTTTGTCCACAGTTGCCAGAGCTTTTGGAGAACAGTCCTTATGTTA 
TTGAAACAAACCTAATCTGTAGCTGAGTTGGGAGGGAGCTAAGTGGACAGAGAGTCCTCC 

AC C C AAAC AAAAGAAT CT T T GAT T CT T GGG C AT AAT GGG AGC AAT AT T T AAAAAAAAAAA (SEQ ID NO: 27) 



104 60 AGGAATGTTTGGGGAAGACTCTTGCGGTGCAAAGGCTGTTTCAGATTGCTGAGATCAGAC 
CTTAAGTACCAAAGCCCAAATATAGTACAACATAATACAAATGAGAAGAAAATAGCTGAA 
GAATAATTCGAGTTTATACAGTACAATTCAAGAGAAGAAAGAAAATTTATGACGACTAGC 
TGGGTGAGAATTAGAACTGTAACCCTGGGAAGGTCCTGGTGATTTGACTCTCACAGGACA 
CCTGATGACCAGAGGATGGGTTTCCTTTGATGGGAAATCTGTGGCGATTCATTGATGGGC 
[C,T] 

TCTGAATTCTGCTGAAGCAGAGGAAGTAGTAATACCCCATTTATAATGGAAGTGCATTCT 
CACTTAAAAACAACTAATATTATTCTAGCTGGACCTAGCCTCTAGAAACAGCCAAATTAC 
ATTTGACTTGAGTGGATTCATAATAATTAAAAAATTTCTGGGGCATGGGATAAATGTGTT 
AGGTATTGCTAAGTCAAGGCAGCCCTATCCCCTCAGCAGAAGTGAGGGAATATGAAAGTG 

TGTGAATGCTAACATAATTTTGGGGAATATCGCCGTCAGATTTCCAGATGATATTCCAAC (SEQ ID NO: 28) 

20204 . TCTGCCAGTTGCGAAGACTGGGAAAAGCACAGTATTTGGGCAGAGTATACTGTTCCTCCA 
GGTACAGTCACTCACGCCTTTCCTTGGCTAGGAAAGGGAAATCCCCTGACCCCTTGCACT 
TCCTGGATGAGGTGACGTCCTGCCCTGCTTTGGCTCACCCTCCATGGGCTGCACCCACTG 
TCCAACCAGTGCCAATGAGATGAACCAGGTACCTCAGTTGGAAATGCAGAAATCACCCAT 
CTTCTGCATCGATCTTGCTGGGAGCTGTAGACCAGAGCTGTTCCTACTGGGGCATCTTGG 
[G, A] 

AGCAACTCTGGGTCTGAGTTTCTGTTTGTTGCCCTGATGTATATCCCCAGTGCCTAGAAT 
GATACTTGTTACATAGGAAGTGCTTGATCCATGTTTGCACAAATGAATCTTTCTCATAAT 
GAGGTTTCTCTAAACAAGCTGTTCTCCCAAAAACTTACACCCAGCTTTATGTTGAAGCAT 
CTCATTATACATTGGAAAGATGAAATGTGTAGTGAGACTTTGAATCTTCTTTTGAATCTA 

GAAACATTAGCATTTTTAGACCATTCTATTTTAATATTTATGAAATTTATGAAATAATAA (SEQ ID NO: 29) 



20362 CCTCCATGGGCTGCACCCACTGTCCAACCAGTGCCAATGAGATGAACCAGGTACCTCAGT 
TGGAAATGCAGAAATCACCCATCTTCTGCATCGATCTTGCTGGGAGCTGTAGACCAGAGC 
TGTTCCTACTGGGGCATCTTGGAAGCAACTCTGGGTCTGAGTTTCTGTTTGTTGCCCTGA 
TGTATATCCCCAGTGCCTAGAATGATACTTGTTACATAGGAAGTGCTTGATCCATGTTTG 
CACAAATGAATCTTTCTCATAATGAGGTTTCTCTAAACAAGCTGTTCTCCCAAAAACTTA 
[C,A] 

ACCCAGCTTTATGTTGAAGCATCTCATTATACATTGGAAAGATGAAATGTGTAGTGAGAC 
TTTGAATCTTCTTTTGAATCTAGAAACATTAGCATTTTTAGACCATTCTATTTTAATATT 
TATGAAATTTATGAAATAATAAGAAACATGAGGCCGGGCTCAGTGGCTTATGCCTGTAAT 
CCCAGCAGTTTGGGAGGCCAGGGCTAGTGGATCATGAGGTCAGGAATTTGAGACCAGCTT 

GGCCAACATGGTGAAACCCCACTTCTACTAAAAATATAAAAATTAGCTGGGCGTGGTGGT ( SEQ ID NO : 3 0 ) 



21166 TAATTCCGCCATTGTTTGCCTTGTGATCTTTGGTGCCATGTCTGTACATATTTCATGATT 
TCTGTGTTTTTACGGTTTCCATTTCAGATCTCCCTTGAGTTTAGAAATCTGGCTGAGAAA 
TACCAAACAGTGATTGCCGACATTTGCCGGAGAATGGGCATTGGGATGGCAGAGTTTTTG 
GATAAGCATGTGACCTCTGAACAGGAGTGGGACAAGGTTAGTCTCATAAAACAGTGTCTG 
TGTGTGATGTATTAGACAGAGCTGGCAGTCCTCATAGTGAAGCTCAGAACAAGAAAAGTT 
[G, A] 

TCCAGTATTTTCAGCCCCTCTGGTTTTACAATTCATCTGTTTAGGTTGAATGTCTCATCA 
TAAACAGTTTATTCCAGAGTTAATTCCAAACCAGCAGCTATGTAGGATATCAGCCAGGCT 
AGGAGTAGGGTACTGGAGAGAAGTGCTTATCTAGACAAAGGGATGTAATTGACCATGAAG 
ATTAAAACTACACATCAAAACATAAGGTAGGGTTAGGAGTCTTGCCTATTTTTCATAGGA 

ATGGTGTTTGTGAGACTTACTCATCACTTCTGTGGAAGTAAAGACATTTTATTTATTTAT (SEQ ID NO: 31) 
' . . 21477 ■ TCAGCCCCTCTGGTTTTACAATTCATCTGTTTAGGTTGAATGTCTCATCATAAACAGTTT 
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ATTCCAGAGTTAATTCCAAACCAGCAGCTATGTAGGATATCAGCCAGGCTAGGAGTAGGG 
TACTGGAGAGAAGTGCTTATCTAGACAAAGGGATGTAATTGACCATGAAGATTAAAACTA 
CACATCAAAACATAAGGTAGGGTTAGGAGTCTTGCCTATTTTTCATAGGAATGGTGTTTG 
TGAGACTTACTCATCACTTCTGTGGAAGTAAAGACATTTTATTTATTTATTTTAAAGCCA 
[G, A] 

' TCAGATTTAGCAGGCAGAGACATTTCAGACATCTAAAGTGTTGATGTATTTCATACCTTT 
AACTGTGCTTAAATTAGGATCTCCGAAAAGATGCTGCTACATGGTCACTACGTTAGTGTA 
GGTCCAAGGTCTTGGGCCTCTTAATTTTTCAAACCTCAAAACTTGACAGCAGTTATCTTT 
GGAACTGCTGATTTGTGCTTCCTAAGTTAACAGCATACAATGACTGCTAGAAATCAATTT 

CTGCATTTAAGGTGAAGTTAGCCGGGTACTATGGTTTACCTGTAATCTCAGCACTTTGGG (SEQ ID NO: 32) 

22230 GGATTGCTTGAGCCCAAGAGTTGAAGGTTGCAGTGAGCCATGATTGTGCCACTGCACTCC 
AACGTGGGTGACAGAGCAAGACACCTACTGAAAGAAAATAAAGTTGAAGTTAAAACTTCT 
GGCCAAGAACCAGCACTGGTTATGATAGTAACTCATTTTCTGTTGTGCAGATTTATTCAG 
GAAACTTAATTTTAGGTTGTTGAATAGAAGTTTTGATCAGATAAAATTGAATTAAAAAAA 
ATTTTTTTTGAGACAGGGTCTTGCTGTTATCCAGGCTGGTGTGTAGTGGTGTGATCACGG 
[C,T] 

TCCCCGCAGCCTCAACCTCCTGGGCTCAGGTGATCCTCCCACCTCAGCCTACCGAGTAGC 
TGTAACTACAGTGCATGACACCATACCAGGCTCATTTTTGTACATTTTTTGTAGAGAGAG 
GGTTTTGCCATGTTGCCCAGGCTAGTCTCAAACTCCTGGCATCAAACAGTCCTCCCACTC 
TGGCCTCTCAAATGTTGGGATTACAGGCATGACCAGCCAATTATTTCAAGGAGTTATTTT 

TTTTCTTCTACTTTGGGGGAAGATGAATTATATAAGTCTCCATTTTAGGAGTATTTCTAC (SEQ ID NO: 33) 

22941 AATTTCTGTCTACCTAATTTCAGCAAGATTTCACTCTTTTCATGTTACTTTTGTCCCAGA 
ACAAATTTCAAGTGCTTTCTCTTCACCTGTGCATTCTTCCCCCTGATTAGTCTCTGGCTT 

. TGTAT.TACTTTCAGTCAGAGACGACTTTTTTTTTTTGAGACAGGGTCTCACTCTGTCACC . . _ . 

CAGACTGGAATGCAGTGGCACAGACAAGGCAGCCTTGACCTTCTGGGCTCAAGCAATCTT 
CCTTGCCCTCAGCCTCCTGAGTAACTGGGACCACAGGCACGTTGCCACCATGCCTGGCTA 
[A,G] 

TTTATTTTAATTTTTATTATTTTTGAGACAGGGTATTGCTCTGTCACCCAGGCTGGAGTG 
TAGTGGCATGATCAAGGCTCACTGCAGCCTTCACCTCCTGTGCTCAAGCAGTCCTCTCAC 
CTCAGCCTCCCCATTAGCTGGGACTATAGGTCCACACCACTACACCAGGCTAATTTTTGT 
AATTTTTTGGTAGAGACAGGGTTTCATCGTGTTGCCTAGGCTGGTCTTGAGCTCCTGGGC 

TCAAGCGATTCACCTGCCTTAGCCTCCCAGGTGTGAGCCACTACACTCAGCCTTTTAAAA (SEQ ID NO: 34) 

23963 ATACTACCTAGTTTTGAACTCTTAGCCCCTGCCACAGACACGGCAGCCCCTTGAACCTTC 
CTGGGTTCAAGCGAGCCTCCTACTTCAGCCCCCTGAGTAACTGGGACCACTGGCCTGTGT 
CACTGTGCCTGGCTAATTTTTTTTTTTTCCTCACATGGGCAATGTTGGGCAAGTTAAATC 
GACTTCTTTGTGCCTCAGTTTCCTCATCTGAAATGGAGATCATACTGCTATGTACCTGAT 
ACAATGTTTGTGAGGATTGAATGTGCAGAGTTCTTTTTTTCTGTTGTTGTTGTTTTGAGA 
[C,T] 

GGAGTCTCACTCTG (SEQ ID NO: 35) 

2568 6 CTGAAAAATCCTTTAACTCTTGTGGTTGCGGGTGACAGAAAAACAAGCCAGGCCTCCCCC 
AGGCAGCATAAGGGGATGTGGAAAATAGGATAGATTGACATGAGTTTGCTTCAGGTAGAC 
TGGCTGACTCCCAGGATTCACACCACGTAATCAGTATATTCAAGCCTTGCTGTCCTTGAT 
TTCTTTCAGACGGTCTTTCTCCAAGTGGTGGATATGGTAACAACCCACGTGCACTAGCTT 
AACAAAAAGTTCTTAGGAATGGCTTTGTTCGGCCTGGCGCAGTGGCTCATGCCTGTAATC 
[A,C] 

CAACAGTTTGAGAGGCCAAGGTGGGCGGATCACCTGAGGCCAGGAGTTCGAGACCAGCCT 
GGCCAACATAGTGAAACCCCGTGTTTACTAAAAAATACAAAAATTAGCCGGGCGTGGTGG 
CAAGGGCTTGTAATCCCAGCTACCTGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCAGG 
AAGCAGAGATTGCGGTGAGCTCAGATTGTGCCACTGCACTCCAGCCTGGGCGACAGAGTG 

AGACTCCCTCTCAAAAGAAGAGGAAGGGCTTGGTTCTTCTGCTCAGCCCTGAATCAGTTA (SEQ ID NO: 36) 

2 6018 ACCTGAGGCCAGGAGTTCGAGACCAGCCTGGCCAACATAGTGAAACCCCGTGTTTACTAA 
AAAATACAAAAATTAGCCGGGCGTGGTGGCAAGGGCTTGTAATCCCAGCTACCTGGGAGG 
CTGAGGCAGGAGAATCGCTTGAACCCAGGAAGCAGAGATTGCGGTGAGCTCAGATTGTGC 
CACTGCACTCCAGCCTGGGCGACAGAGTGAGACTCCCTCTCAAAAGAAGAGGAAGGGCTT 
GGTTCTTCTGCTCAGCCCTGAATCAGTTACTGTTGCTACACAGCTGAGTTCTCTGGCCTC 
[A, G] 

C CT GGAT T AC GT CT AC AC AGT AC AC ACAG AATGG AT T T C CC C C AAAGAAAGAAT T CT GCG 
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GCAGGAAGGGGAAAGGGATGGCAGGTAGACAAAAACTCCAGGTGTCTGTAATAAGGGACA 
GGGTCGATCTTTAATTAAAACATGGACAGGGAACAGAAAGCTTTTGATACTGATTTTGTT 
CAGAAGGAAAGTAGAAAATTTTATGACTGTTCCCTGAATTTATTCCAGCATTTACCTTTT 

GCTTTCCATAAAAGTGTTTCCTGCAGCCAAGTACTTTAAAGTTTTAAAAAGACGGGTGAG ( SEQ ID NO : 37 ) 

26078 AAAATACAAAAATTAGCCGGGCGTGGTGGCAAGGGCTTGTAATCCCAGCTACCTGGGAGG. 

CTGAGGCAGGAGAATCGCTTGAACCCAGGAAGCAGAGATTGCGGTGAGCTCAGATTGTGC 
CACTGCACTCCAGCCTGGGCGACAGAGTGAGACTCCCTCTCAAAAGAAGAGGAAGGGCTT 
GGTTCTTCTGCTCAGCCCTGAATCAGTTACTGTTGCTACACAGCTGAGTTCTCTGGCCTC 
ACCTGGATTACGTCTACACAGTACACACAGAATGGATTTCCCCCAAAGAAAGAATTCTGC 
[G,A] 

GCAGGAAGGGGAAAGGGATGGCAGGTAGACAAAAACTCCAGGTGTCTGTAATAAGGGACA 
GGGTCGATCTTTAATTAAAACATGGACAGGGAACAGAAAGCTTTTGATACTGATTTTGTT 
CAGAAGGAAAGTAGAAAATTTTATGACTGTTCCCTGAATTTATTCCAGCATTTACCTTTT 
GCTTTCCATAAAAGTGTTTCCTGCAGCCAAGTACTTTAAAGTTTTAAAAAGACGGGTGAG 

GCTAAGTGTGGTGTCTCATACTTATAATCCCAGTGCTGAGGCCAGGAGTTCAAGACCAGC (SEQ ID NO: 38) 

2 6625 TGTGGTGTCTCATACTTATAATCCCAGTGCTGAGGCCAGGAGTTCAAGACCAGCCTGAGC 
AACACAGCAAGATACCATCTCTATAAAAAATTGTTAGAAAATGATTCTGCTGAAAGAGCA 
AAAATAAAAATTAAAGAAAGTAGAAAAAATAAAACTAAATTTAAAAGATTAACTGGGCAT 
GTTGGCATGCACCTGTATTCCTAGGTATTCGGGAGGCTAAGGCACAAGGATCCCTTGAGC 
GCAGGAGCTCAAGGTTGGAtTGAGTTGTAATCACACCACTGCACTCCAGCCTCGGTGGCA 
[C,G] 

AATGAAACTGTCTCAAGAAAAAAAAAAAGTGACAGAGGGAAACAATATTTGCAATTCATA 
GAGCAGATACAGGGTTCATATTCCTAATATTAAAAAAAACTTCTAAAAGTTAAGAAAAAG 
GCCAACTGCCCCACAGAAAAATGGGCAAGGAGATAAGAACAAGATTGTTCACAGGAAGAG ... 
ACACACAGATGATTATTAAAAATCTGAAAAGATGCTGAGTCTTACTCCTAAGAAAAATTC 

ACATTTAAACTACTCTGGGGGCTGGGCAAGGTGGCTCACGCCTGTAATCTCAACACTGGG (SEQ ID NO: 39) 

27151 TCCTAAGAAAAATTCACATTTAAACTACTCTGGGGGCTGGGCAAGGTGGCTCACGCCTGT 
AATCTCAACACTGGGAGACCAAGGCAGGAAGATCACTGAAGCCAGGGTATCGAGACCAGC 
CTGGACAACGTAGTGAGACCTTATCTCTTAAAACAAAACAAAACAAAACAAAACAAAAAA 
AACAGTAAAAATTGGCCGGGCACAGTGACTCCTGCCTATAATCCCAGCACTTTGGGAAGC 
CCAGGTGAGTGGATCACTTGAGGTCAGGTGTTTGAGAACAGCCTGGCCAACATGGCAAAA 
[C,T] 

TCCGTCTCTACTAAAATTACAAAAATTAGCCAAGTGTGGTGGCATACGCTGGTAGGGCCA 
GCTACTTGGGAGGCTGATGTGAGACTCCATTTAAAAAAAAAAAATCAAAAATTAGCTGGG 
TATAGTGGCACACCCCTATAGTTCTCGCTCCTTGGGAGGTTGAGGCAGGAGGATTGCCTG 
AGCCCAGGAGTTCAAGGCTGCAGTGAACCATGATCACACCACTGCATTCTAGCAGCCTGG 

G AGACAGAGC AAAAC CCTTGTCT C AAAAC AAAC AAAC AAC AAC AAAAAC AAAAAAC ACT T (SEQ ID NO: 40) 



28032 AGGAGCAGAGCCCTGCTCTTCTCATTCACTTACTTTATCTGTAAAATAGCATCATTTCTA 
CCACACGGTGGTGGTGTGAATAAAATGAGATGAACTTCTAGCATAGAGTGCTTAGTAAAG 
GTTCTGGACATTTCGTAGTAGTTGAATCATGCCAAATGTGGTCCTAGGTGATTGGCTTCT 
TTTGCTAGCATGTTTTCAGGGCTCCTCCATGCTGGGGCATTGCATCACTGCTTTATTCCT 
TTTTATCGCCTAGTATTATTCCACTGTGTGGATAGACCACATTTATCCATTCATCAGTTG 
[G,A] 

AGGATATTTGGGTTCTTCCCATTTTTTTTGGCTATGGTGAATAGTACTGTGTACATTTGC 
ATATAAGGTTTTGTGTAGATGTGTGTTTTCCTTTTTCTTGGGTCTATGCTGAGAAGTGGA 
ATTGCTGGTTCATACAGCAGCTCGAACCTTGTGAGGAGCTGCCAGACGCTTTTCCAAGGT 
CGCTCCACCATTTTACATTCCCGTCAGCAGTGTGAGAGTCCCAGTTTCACCAGCACTTGT 

T GT TAT C T CT TT T T AAC T GT AT GT AT AT AT ACT T AACATT TT ATT T AT AAT AAAT G T AC A (SEQ ID NO: 41) 

287 72 AAAAATCATCAAGCCGAATCCCACTGTTAGAATTAAAGGTTTTATTTCACTTTCAAGTTA' 
TCAGGATCCAGGGAGGTGTAATACACTTAGAGGATAGACTCAGCTCATTTCCCAGCTATG 
CCTTTCAGCAGCATTCTTACCAGAGTAGGAATATAATGTTAGTCATTATTTAGAGGCCTG 
GCCATCTTGAGAAGGTTTACTGTTTAGTCTGCAGTACAATTATAACTGTTTTTGTATATT 
GGGTTATTTTTTTCAGAAGTAGGCCAGTAGCTCTAACAGGAGCCTCTTTAGCCTGAATTC" 
[G, A] 

TCCAAGTAGTGCAGTGTTGCACTAGTTGTCCCTCGGGACATGCTCCCCAATACGTAACTC 
ACTTCCAGGTTGCAACTGGACACTTACTGGTAGTCAGAAATAGCTATTGCATGGAGCTTA 
AAATGAACTTGATCTTCGTGAAAGATGAGTCTGCAGCTAAGAGACTTTACTGTATATCAT 
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AGTGTTTTTTTTTGTTTTGTTTTGTTTTTGTTTTTGTGACGGAGTCTCACTCTTTCACCC 

AGGCTGGAGTGCAATGGCGAGATCTTGACTCACTGCAACCTCCGCCCCCTAGGTTCAAGC {SEQ ID NO: 42) 

29572 TCATAGTTCTTATGCACAAAGACCCTTTAATATTGTTTGTAAATTCTCCCCTATGCACAC 
GCTGACCTGTTCCTTAATCTTCTTATCTGTCTAGGTTTGGAGCAGGTATGTTAAGAAGTT 
AGGGGATTTTGCTAAGCCGGAGAATATTGACTTGGCCGTGCAGTGCCTGAATGAACTTAT 
AACCAATGCACTGCACCACATCCCAGATGTCATCACCTACCTTTCGAGACTCAGAAACCA 
GAGTGTGTTTAACTTCTGTGCTATTCCACAGGTAGGGAACGGGGCTCCTCTGGGTGGATA 
[C,T] 

GGGGCTAAAGGGAGTGGGGTAGGAGTAAGGGTGGATTTTGCTGTGCTATATTCAAGGATA 
TGATTCCTTAAAAAGACGATGACTCCAGTTTATTACGCTGGGAGTTTCATAGCACCCGCC 
TTTGCTTCCAGCCACCAAACTCAGCTCAGCCTTGAGGTTAAGCCTGCTCCTTTTCAGAAC 
CTTCTTTCCGGATTTACTATTTTCTACAGCTATCCTAAACTAGTTAGGTTCTTTTCCTCA 

CAGTTAAGTCAAGGTCTTTGGCTTAGATTTATGGGGAGTGCTGGGTAAAACCTGGGTGAA (SEQ ID NO: 43) 



2 97 61 ACTGCACCACATCCCAGATGTCATCACCTACCTTTCGAGACTCAGAAACCAGAGTGTGTT 
TAACTTCTGTGCTATTCCACAGGTAGGGAACGGGGCTCCTCTGGGTGGATACGGGGCTAA 
AGGGAGTGGGGTAGGAGTAAGGGTGGATTTTGCTGTGCTATATTCAAGGATATGATTCCT 
TAAAAAGACGATGACTCCAGTTTATTACGCTGGGAGTTTCATAGCACCCGCCTTTGCTTC 
CAGCCACCAAACTCAGCTCAGCCTTGAGGTTAAGCCTGCTCCTTTTCAGAACCTTCTTTC 
[C,T] 

GGATTTACTATTTTCTACAGCTATCCTAAACTAGTTAGGTTCTTTTCCTCACAGTTAAGT 
CAAGGTCTTTGGCTTAGATTTATGGGGAGTGCTGGGTAAAACCTGGGTGAAGCTGTTATC 
ATTAAAAAGTCTTCATTAAGCACCTAATTACTGCTGTCCTTTTCCTAGACCCGGCATAAA 
AAGAACCTGGTCCGGTAGACCTAGCCTCTCAGTATGCTAGGAACTTACACTTTTTAGTTG 

CCTTTACCAAGTATTGCAGATACTACTGCAAATAAGTGAAGAAAGTAACAGCATTTAACT (SEQ ID NO: 44) 



30732 ATTCTGTGTGTTGTTGAGAAAGGGAGGAGTGGGGAAGGTAAAAATCTTGACATACTTTCT 
TCGTGGGTATTTTTTCTTGAGCGATTCCATCTTAGTTGATTAGCAGTTAGCAATTGCCCA 
TTCAACAGAAGGTTTTCTTACCTTTTTGTGATAATGATAGCTAACGACATCATTTCTTCT 
TTTTTCCCTCTCTTCTTGTTGTCTCTAGGTGATGGCCATTGCCACTTTGGCTGCCTGTTA 
TAATAACCAGCAGGTGTTCAAAGGGGCAGTGAAGATTCGGAAAGGGCAAGCAGTGACCCT 
[G,C] 

ATGATGGATGCCACCAATATGCCAGCTGTCAAAGCCATCATATATCAGTATATGGAAGAG 
GTGGGTTTTTATTTAACTACTTGGATAATTTGTAGCTACTTTTATGATTTAGTAATGTCA 
CTGTTTAACCAGGTTTGGATATTAGATGATCCTAACAATTCACTATCCTGTGGCCTAAAG 
AGACAGGAATTGATATCCTTTATAAGGAAAAAAGTCTATTCACAGGAGCCGAGCAGATTG 

CTCACTGCTGTGT AGTACCCTGGTGAGAGGAGATAAATGGAGCAAGGCTGTAGGTTGGAG ( SEQ ID NO: 4 5) 



30841 GCAATTGCCCATTCAACAGAAGGTTTTCTTACCTTTTTGTGATAATGATAGCTAACGACA 
TCATTTCTTCTTTTTTCCCTCTCTTCTTGTTGTCTCTAGGTGATGGCCATTGCCACTTTG 
GCTGCCTGTTATAATAACCAGCAGGTGTTCAAAGGGGCAGTGAAGATTCGGAAAGGGCAA 
GCAGTGACCCTGATGATGGATGCCACCAATATGCCAGCTGTCAAAGCCATCATATATCAG 
TATATGGAAGAGGTGGGTTTTTATTTAACTACTTGGATAATTTGTAGCTACTTTTATGAT 
[G,T] 

TAGTAATGTCACTGTTTAACCAGGTTTGGATATTAGATGATCCTAACAATTCACTATCCT 
GTGGCCTAAAGAGACAGGAATTGATATCCTTTATAAGGAAAAAAGTCTATTCACAGGAGC 
CGAGCAGATTGCTCACTGCTGTGTAGTACCCTGGTGAGAGGAGATAAATGGAGCAAGGCT 
GTAGGTTGGAGCCCCTCAGTAGAATCATAGATTTTGAGCTGCAAGATGATGCAGGAGGCC 

AACCAAGCTTCTTGTTGCTGGTGAGGAATGTGAGGTTGAAGCTTGTCTGTGCTGATGCAG (SEQ ID NO: 46) 



3137 6 GAGGCCAACCAAGCTTCTTGTTGCTGGTGAGGAATGTGAGGTTGAAGCTTGTCTGTGCTG 
ATGCAGTGCGTGATTGAGTGGATCTCTGGCTCCCGTCCATGTGTCCTGACACCCAGTCTG 
GTACTTTCATTATGCCACAGGCCTCAATTGAAAAATCACAGTAGGGAATTTAGGCCAAGG 
AAAGCCATCAAGTTGCAATTATTTCCTAAATTTTCTTTGGAAAATTTCATTTCAAATACC 
AAAACCATCCTATAAAAAGAAAACTTACCTTCTTAGGTCAAATCTCTAATATTTGACTAG 
[G, A] 

TTCAAAAAGTTTATTTCTGGCCAGGCACAGTAGCTTACTCCTGAAATCCCAGCACTTTGG 
GAGACCAAGGTGGGAGGATCACTTGAGGCCAGGAATTCAAGACCAGCCCGGGCGACATAG 
CAAGACCCCATTTCTACAAAAAATTTAAAAATTGTCATGGTGGTGCACGCCTGTGGTCCC 
AGCTACTCAGGAGGCTGAGGCAGGTGGATCACATGAGCCTGAGAGGTCGAGGCTACAGTA 

AGCTGTGTGATTTCATCATTGCACTCTAGCCTGGGTGATAGAGTGAGACTTTGTCTCAAA (SEQ ID NO: 47) 
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32032 TCTCTAGGCCCTAGAGCAGTGGTTTGTAAATGGAGGTGATTTGCTCCCCTCCCCCCAGAG 
GACATTGGACAATGTCTGGAGACATTTTTGATTGTCCTAACCGGCAGGAATCGGGTGCTA 
CTGGCATCTGGTGAGTAGAGGCCCAGGATGATGCTGTGATCCTCAGGTGTGATCCTGTTG 
AGAATGAAACACTGTAGACTTTATGAAAACATACAAGACCCTCATCATTTTTCCTTTGCC 
TGAGCTCCCTCCCCAGAGGTTACCTCTGTTCATGGTTTTGTGCATCCGTCTAGTCCCCCT 
[A, G] 

TTACGCGTTTACAGGAATATGGTTTGCAACAGTGTTTTCATCTAAATAGAATTATACAAA 
ATAGCGATTTCTGATTTCTCTTGCATATTGCACATTCTTCTTATACTTCCTCCCTACCTT 
TATCTGACACAGAAATGCTGTATGTCCAGAACTTCTATCAGAGGCACCTATGGAAGTCTA 
AGGGAAGACCACATCGCTTTTAAAAACCCTAAAATTTTGTAGTCACTAGATGAAAATATT 

CAGCCAGTGACCCAAAAAATTGCTACCAATGAGACTCTCCATTTTGCCATGTAGCCAGAA {SEQ ID NO: 48). 

32525 ATCGCTTTTAAAAACCCTAAAATTTTGTAGTCACTAGATGAAAATATTCAGCCAGTGACC 
CAAAAAATTGCTACCAATGAGACTCTCCATTTTGCCATGTAGCCAGAACTTACTTTGATC 
TATGTGCCTGGGGTAGTGACCAAGTAGGTGGGTAGGAGTAATCTCAGGGAAACTTGAGGC 
CCCAGCCTCATGGCTAGGGTCATAATTTGAACCCAGGTCTGTCTGACATCAGAATCCATG 
ATGTTAACCCCAATTCTAAGGGGTTCAACTACCCTTTCTAAATGGAATCCTGCTATATTA 
[A, G] 

GCACTATTTATTCATTTTATATAAACTAGAAACATTTTATGTAGTAAGTAGTTGAGAGTG 
TTTTGGTTTTGCAGTTTGATCACTAGTTTTAGAAACCAGTTTTTAAACACTTTGTGGCCA 
ATTCCATTACTATATTAAAATTCAGATTTATTTGGTTTTTCCTTAACTATTGGGATTAAA 
TCCTGGTTGTAATTCATAGTTTGAGGGCGAGGGTGGGCAGTCTACATTTGGCTGAGCCCT 

GTTTTTGTGAATAAATGTTATCAGi\ACACAGCCACACCCATTTGCTTCTATGTCTTCTGT (SEQ ID NO: 49) 

3417 9 . CTGCTGTATGTAGCACAGCATTGCACAAGAGCTTATTTCAGTCTAGTAAACATTTATAGG. . - 

AGCCTGTGTCATTTAATCATCAAGCCTCGCACTGTGGCTCACACCTGTAATCCCAAAACT 
TTGGGAGGCTGAGGCAGGCAGATCACTTGAGGTAAGGAGTTCGAGACCAGCCTGGCCAAT 
ATGGCAAAACCCTGTCTCTACTAAAAATACAACATTTAGCCAGGTGTGGTGGTGCACACT 
TGTCATCCCAGCTATTCCGGAGCCTGAGACATGAGCATCGCTTGAACTCGGGAGGTGGAG 
[G,T] 

TTGTAGTGAGCTGAGATGGCACCACTGCACTCCAGCCTGGGCAACAGGGTGAAGGCCCTT 
TCTCAAACTCCTCAAGTATTTGGCTTCAACTTTATGCCGGGCATGTAGATGAAAAGTCGG 
CTATGACCTGTCCTTGACAAGCAGATGTAACTCCTTGATTGAGGCTAGTAGGTTTTTAAG 
ACCTGAATAATTGAGTTTGCAGAAACCTACTGTGTGCCTTCAGGTAAATGGAGAGTGGGG 

TTTGGTCTAGCAACGAAGCATCTAGAAGGTCTCTTTGGCCTTACCGGCTCTGTTTTAGGT (SEQ ID NO: 50) 



34249 ATTTAATCATCAAGCCTCGCACTGTGGCTCACACCTGTAATCCCAAAACTTTGGGAGGCT 
GAGGCAGGCAGATCACTTGAGGTAAGGAGTTCGAGACCAGCCTGGCCAATATGGCAAAAC 
CCTGTCTCTACTAAAAATACAACATTTAGCCAGGTGTGGTGGTGCACACTTGTCATCCCA 
GCTATTCCGGAGCCTGAGACATGAGCATCGCTTGAACTCGGGAGGTGGAGGTTGTAGTGA 
GCTGAGATGGCACCACTGCACTCCAGCCTGGGCAACAGGGTGAAGGCCCTTTCTCAAACT 
[T,C] 

CTCAAGTATTTGGCTTCAACTTTATGCCGGGCATGTAGATGAAAAGTCGGCTATGACCTG 
TCCTTGACAAGCAGATGTAA'CTCCTTGATTGAGGCTAGTAGGTTTTTAAGACCTGAATAA 
TTGAGTTTGCAGAAACCTACTGTGTGCCTTCAGGTAAATGGAGAGTGGGGTTTGGTCTAG 
CAACGAAGCATCTAGAAGGTCTCTTTGGCCTTACCGGCTCTGTTTTAGGTAAGTCCACGT 

CTGAGTACCAGTGACTGCAGCTCTTCCAGTTGTGCTGTCATGTTTATATGTTAGAAATGA {SEQ ID NO: 51) 



34 4 51 GAGCATCGCTTGAACTCGGGAGGTGGAGGTTGTAGTGAGCTGAGATGGCACCACTGCACT 
CCAGCCTGGGCAACAGGGTGAAGGCCCTTTCTCAAACTCCTCAAGTATTTGGCTTCAACT 
TTATGCCGGGCATGTAGATGAAAAGTCGGCTATGACCTGTCCTTGACAAGCAGATGTAAC 
TCCTTGATTGAGGCTAGTAGGTTTTTAAGACCTGAATAATTGAGTTTGCAGAAACCTACT 
GTGTGCCTTCAGGTAAATGGAGAGTGGGGTTTGGTCTAGCAACGAAGCATCTAGAAGGTC 
[T,C] 

CTTTGGCCTTACCGGCTCTGTTTTAGGTAAGTCCACGTCTGAGTACCAGTGACTGCAGCT 
CTTCCAGTTGTGCTGTCATGTTTATATGTTAGAAATGATCATCAAAGGACTCAAAAGTTT 
TGCCACTAATTGTATTACCGGGGACTGTCACAACCAAGATTTCTCTTAATTTATTCACCT 
TACTTATCTCCTGGAAGGGCATATTGAAGTGCTCTTGGAGTTCTCTAAAAGGGTTTTTGT 

TGGTTGTGTATATTCACTTGGGTGCCAGCGATTGATTCCAAATAAGTAAATCTTTTTTCC (SEQ ID NO: 52) 



34532 AGGCCCTTTCTCAAACTCCTCAAGTATTTGGCTTCAACTTTATGCCGGGCATGTAGATGA 
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AAAGTCGGCTATGACCTGTCCTTGACAAGCAGATGTAACTCCTTGATTGAGGCTAGTAGG . . 

TTTTTAAGACCTGAATAATTGAGTTTGCAGAAACCTACTGTGTGCCTTCAGGTAAATGGA 
GAGTGGGGTTTGGTCTAGCAACGAAGCATCTAGAAGGTCTCTTTGGCCTTACCGGCTCTG 
TTTTAGGTAAGTCCACGTCTGAGTACCAGTGACTGCAGCTCTTCCAGTTGTGCTGTCATG 
[T,C] 

TTATATGTTAGAAATGATCATCAAAGGACTCAAAAGTTTTGCCACTAATTGTATTACCGG 
GGACTGTCACAACCAAGATTTCTCTTAATTTATTCACCTTACTTATCTCCTGGAAGGGCA 
TATTGAAGTGCTCTTGGAGTTCTCTAAAAGGGTTTTTGTTGGTTGTGTATATTCACTTGG 
GTGCCAGCGATTGATTCCAAATAAGTAAATCTTTTTTCCCAAAAGGATGTAAGATGGCTT 

ATGGTTATAAGTACAACAGGCTAACAAAGTACAAGTAGATGAGAAAGTAAAATGAAGAAA (SEQ ID NO: 53) 

36541 • GGTAGGAGCCAGTTGAAGGGACGTGGGAGGCGCATTCCAGAGAGAAGGAGTGGTATGAGA 
CTGGAACAGAGGTGTGCAGCAGCATCGCATGGGCGAAACAACAGTAGACAGTTGTTCTTT 
TGTTTTTGTTTGTTTTTTGAGACAGGGTCTTGTTCTGTCATCCAGGCTGGAGTGCAGTGG 
CATGATCTCGGATCACTGCAACCTCCACCTCCCAGGCTCAAGTGATCTTCCCACCCCAGT 
CCCCAAGTAGCTGGGGGACCACAGGTGCATGCCACGATGCCCGGCTAATTTTTGTACATT 
[T,C] 

TGTAGAAACAGGGTTTTACTGTGTTGTCCAGGCTGGTCTTAAACGCCTGAGCTTAAGCAG 
TCTACATGCCTCAGCCTCCTGAAGTGCTGGGATTCCAAACATGAGCCACTGTGCCTGGCC 
CGGCAACTGTTACTAGACTATAGAGAGGGAGGTGGGCAAGGGCTGGTGACACTAGACAGG 
TGCAGTAGGTCTGGACCATGGGTGGCCTTGCGCTACACATTACAGAGCTCAGGCTTTTTT 

TCTCCAGGTGAGAGGGCTGGTGCCACTGAGGCATCAAGCAGAGGTTTGAGATCTCCTTGG (SEQ ID NO: 54) 



36607 CAGAGGTGTGCAGCAGCATCGCATGGGCGAAACAACAGTAGACAGTTGTTCTTTTGTTTT 
TGTTTGTTTTTTGAGACAGGGTCTTGTTCTGTCATCCAGGCTGGAGTGCAGTGGCATGAT 
CTCGGATCACTGCAACCTCCACCT.CCCAGGCTCAAGTGATCTTCCCACCCCAGTCCCCAA 
GTAGCTGGGGGACCACAGGTGCATGCCACGATGCCCGGCTAATTTTTGTACATTTTGTAG 
AAACAGGGTTTTACTGTGTTGTCCAGGCTGGTCTTAAACGCCTGAGCTTAAGCAGTCTAC 
[A,G] 

TGCCTCAGCCTCCTGAAGTGCTGGGATTCCAAACATGAGCCACTGTGCCTGGCCCGGCAA 
CTGTTACTAGACTATAGAGAGGGAGGTGGGCAAGGGCTGGTGACACTAGACAGGTGCAGT 
AGGTCTGGACCATGGGTGGCCTTGCGCTACACATTACAGAGCTCAGGCTTTTTTTCTCCA 
GGTGAGAGGGCTGGTGCCACTGAGGCATCAAGCAGAGGTTTGAGATCTCCTTGGTGACAG 

TGTAGAGCAGACAGGTAGATTTGGGAATTTAAGCTTAGACTCACGTTGGAGACTGAGATA (SEQ ID NO: 55) 



3 6681 GACAGGGTCTTGTTCTGTCATCCAGGCTGGAGTGCAGTGGCATGATCTCGGATCACTGCA 
ACCTCCACCTCCCAGGCTCAAGTGATCTTCCCACCCCAGTCCCCAAGTAGCTGGGGGACC 
ACAGGTGCATGCCACGATGCCCGGCTAATTTTTGTACATTTTGTAGAAACAGGGTTTTAC 
TGTGTTGTCCAGGCTGGTCTTAAACGCCTGAGCTTAAGCAGTCTACATGCCTCAGCCTCC 
TGAAGTGCTGGGATTCCAAACATGAGCCACTGTGCCTGGCCCGGCAACTGTTACTAGACT 
[A,G] 

TAGAGAGGGAGGTGGGCAAGGGCTGGTGACACTAGACAGGTGCAGTAGGTCTGGACCATG 
GGTGGCCTTGCGCTACACATTACAGAGCTCAGGCTTTTTTTCTCCAGGTGAGAGGGCTGG 
TGCCACTGAGGCATCAAGCAGAGGTTTGAGATCTCCTTGGTGACAGTGTAGAGCAGACAG 
GTAGATTTGGGAATTTAAGCTTAGACTCACGTTGGAGACTGAGATAGCTCATCTGAGAGG 

CACTCAGGGCCTAATCTCAGGCAGTAATTTTAGGGATGTAGGGGAAGAGATGGATTCTGC (SEQ ID NO: 56) 

37 4 93 TGACGTTTATTGGGCCTGGCACTGTGAGGTGCTGGGGATGTGAAGATCATTGTGGCTCAG 
CCGCTGCTCTCGAGGGCCTCTGGGTGCAGTATGCACACCTGTGCCTCCTGTTTGCTCAGG 
AAGACAGGCTTTGAGATGAGCTGGGGCTGACATCCCCACCTTATCATTGGGATGGCTTTG 
GGTAAGTTATGTTCATGTTCTCTGAGCCTCCCTTTCCTCATTGGTAAAATGGGTATAAAA 
TACCTGCCAGTGGAGGGTTGTTGTAAGTAGCCATGGAAAATGTAAAGCACATAGCACTTA 
[C,T] 

CATTTTTTCCTGTGTCTTTAACAGATTTATCATAGAATCCCCGACTCAGACCCATCTTCT 
AGCAAAACAAGGCAGATCATCTCCACCATCCGGACGCAGAATCTTCCCAACTGTCAGCTG 
ATTTCCCGAAGCCACTACTCCCCCATCTACCTGTCGTTTGTCATGCTTTTGGCTGCCCTG 
AGCTGGCAGTACCTGACCACTCTCTCCCAGGTAACAGAAGACTATGTTCAGACTGGAGAA 

CACTGATCCCAAATTTGTCCATAGCTGAAGTCCACCATAAAGTGGATTTACTTTTTTTCT (SEQ ID NO: 57) 

37 966 CTGCCCTGAGCTGGCAGTACCTGACCACTCTCTCCCAGGTAACAGAAGACTATGTTCAGA 
CTGGAGAACACTGATCCCAAATTTGTCCATAGCTGAAGTCCACCATAAAGTGGATTTACT 
TTTTTTCTTTAAGGATGGATGTTGTGTTCTCTTTATTTTTTTCCTACTACTTTAATCCCT 
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.AAAAGAACGCTGTGTGGCTGGGACCTTTAGGAAAGTGAAATGCAGGTGAGAAGAACCTAA 
ACATGAAAGGAAAGGGTGCCTCATCCCAGCAACCTGTCCTTGTGGGTGATGATCACTGTG 
[C,A] 

TGCTTGTGGCTCATGGCAGAGCATTCAGTGCCACGGTTTAGGTGAAGTCGCTGCATATGT 
GACTGTCATGAGATCCTACTTAGTATGATCCTGGCTAGAATGATAATTAAAAGTATTTAA 
TTTGAAGCACCATTTGAATGTTCGTACTAGTAGAAAATGATGTGAATTTTCTTTCTGTTC 
GGCTCCTATTTTTCTCATCATTTTGTTTTCTTTAATTGGGTTGAATGGAGTAGATAGAAA 

TATTTAT.GGTTTAGGTAACAGTTAGATGTTTCCTAAGAATGCAAACTGCCTTTTCCACAC (SEQ ID NO: 58) 

37 973 GAGCTGGCAGTACCTGACCACTeTCTCCCAGGTAACAGAAGACTATGTTCAGACTGGAGA 
ACACTGATCCCAAATTTGTCCATAGCTGAAGTCCACCATAAAGTGGATTTACTTTTTTTC 
TTTAAGGATGGATGTTGTGTTCTCTTTATTTTTTTCCTACTACTTTAATCCCTAAAAGAA 
CGCTGTGTGGCTGGGACCTTTAGGAAAGTGAAATGCAGGTGAGAAGAACCTAAACATGAA 
AGGAAAGGGTGCCTCATCCCAGCAACCTGTCCTTGTGGGTGATGATCACTGTGCTGCTTG 
[T,C] ' 

GGCTCATGGCAGAGCATTCAGTGCCACGGTTTAGGTGAAGTCGCTGCATATGTGACTGTC 
ATGAGATCCTACTTAGTATGATCCTGGCTAGAATGATAATTAAAAGTATTTAATTTGAAG 
CACCATTTGAATGTTCGTACTAGTAGAAAATGATGTGAATTTTCTTTCTGtTCGGCTCCT 
ATTTTTCTCATCATTTTGTTTTCTTTAATTGGGTTGAATGGAGTAGATAGAAATATTTAT 

GGTTTAGGTAACAGTTAGATGTTTCCTAAGAATGCAAACTGCCTTTTCCACACAAAGGCT (SEQ ID NO: 59) 



.38113 TCTCTTTATTTTTTTCCTACTACTTTAATCCCTAAAAGAACGCTGTGTGGCTGGGACCTT 
TAGGAAAGTGAAATGCAGGTGAGAAGAACCTAAACATGAAAGGAAAGGGTGCCTCATCCC 
AGCAACCTGTCCTTGTGGGTGATGATCACTGTGCTGCTTGTGGCTCATGGCAGAGCATTC 
AGTGCCACGGTTTAGGTGAAGTCGCTGCATATGTGACTGTCATGAGATCCTACTTAGTAT 
. . . GATCCTGGCTAGAATGATAATTAAAAGTATTTAATTTGAAGCACCATTTGAATGT.TCGTA . . 

[C,A] 

TAGTAGAAAATGATGTGAATTTTCTTTCTGTTCGGCTCCTATTTTTCTCATCATTTTGTT 
TTCTTTAATTGGGTTGAATGGAGTAGATAGAAATATTTATGGTTTAGGTAACAGTTAGAT 
GTTTCCTAAGAATGCAAACTGCCTTTTCCACACAAAGGCTGGGAATAAAATTCTGGGTAT 
. TCTCGTATTCTCATTTAAAGGAGTTTAGCTTTCAGAGAGAAACAGCAGGATTGCTTTTGA 

CCTTTTAGAAGATTGGTCTCCAGTAAAGGTGGACATTTTTGAGATTTTTATAATAAAGAA (SEQ ID NO: 60) 

382 98 CAGGGTTTAGGTGAAGTCGCTGCATATGTGACTGTCATGAGATCCTACTTAGT ATGATCC 

TGGCTAGAATGATAATTAAAAGTATTTAATTTGAAGCACCATTTGAATGTTCGTACTAGT 
AGAAAATGATGTGAATTTTCTTTCTGTTCGGCTCCTATTTTTCTCATCATTTTGTTTTCT 
TTAATTGGGTTGAATGGAGTAGATAGAAATATTTATGGTTTAGGTAACAGTTAGATGTTT 
CCTAAGAATGCAAACTGCCTTTTCCACACAAAGGCTGGGAATAAAATTCTGGGTATTCTC 
[G,C] 

TATTCTCATTTAAAGGAGTTTAGCTTTCAGAGAGAAACAGCAGGATTGCTTTTGACCTTT 
TAGAAGATTGGTCTCCAGTAAAGGTGGACATTTTTGAGATTTTTATAATAAAGAATTTAA 
TTGCTCTGCATTTGTCAAGTACAGTTCGCTTGAAAGCCTGCCTGACTGTGGAAAAGATGG 
AGCTCAAGAATGGAGTTGATGGCCCAGCGTGGTGGCTCATGCCTGTAATCCCAGCACTTT 

GGGAGGCTGAGGCGGTCGGATCACGACATTAGGGGATCGAGACCATCCTGGCTAACACGG (SEQ ID NO: 61) 
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